Keratan sulfate specific transporter molecules

ABSTRACT

The present invention relates to isolated molecules, peptides, and polypeptides of specific consensus sequences or structures, and to compounds comprising or consisting of such molecules, peptides or polypeptides, that function as transporter moieties or compositions specifically recognizing the proteoglycan, keratan sulfate. The isolated molecules, peptides, polypeptides and compounds of the invention may be conjugated or otherwise linked to a biologically active moiety (BAM). Thus the BAM conjugates allow the specific targeting and delivery of the BAM, which may be, for example, a peptide, chemical entity or nucleic acid, into the cytoplasm and/or nuclei of keratan-sulfate expressing cells in vitro and in vivo.

The present application is a National Phase application of International Application No. PCT/EP2015/059035, filed Apr. 27, 2015, which claims priority to European Application No. 14165947.4 filed Apr. 25, 2014. The entirety of each application is incorporated by reference herein. The International Application was published in English on Oct. 29, 2015 as Publication No. WO 2015/162287 Al, the entire contents of which are hereby incorporated by reference herein.

1. FIELD OF THE INVENTION

The present invention relates to isolated molecules, peptides, and polypeptides of specific consensus sequences or structures, and to compounds comprising or consisting of such molecules, peptides or polypeptides, that function as transporter moieties or compositions specifically recognizing the proteoglycan, keratan sulfate (KS). The isolated molecules, peptides, polypeptides and compounds of the invention may be conjugated or otherwise linked to a biologically active moiety (BAM). Thus the BAM conjugates allow the specific targeting and delivery of the BAM, which may be, for example, a peptide, chemical entity or nucleic acid, into the cytoplasm and/or nuclei of keratan-sulfate expressing cells in vitro and in vivo.

2. BACKGROUND OF THE INVENTION

Cellular membranes are generally impermeable to macromolecules, including proteins and nucleic acids. Moreover, even smaller molecules may enter living cells only at very low rates and in the presence of high, potentially toxic extracellular concentrations. The lack of means for specifically targeting and delivering a compound of interest into specific cells or tissues has been an obstacle to the therapeutic, prophylactic and diagnostic or experimental use of a potentially large number of biologically active molecules having intracellular sites of action.

Over the past decade various means for intracellular delivery of compounds have been investigated in an attempt to facilitate efficient transfer of a substance of interest from the external medium into tissues or cells. The most common delivery constructs have been based on antibodies (or antibody fragments) or on viral and bacterial peptides discovered to have membrane binding and transport activity. For example, transporter constructs have been investigated based on herpes viral VP22 protein, polypeptides comprising the human immunodeficiency virus (HIV) TAT protein, and polypeptides comprising a homeodomain of an Antennapedia protein (Antp HD), as well as functional fragments and modifications thereof.

The majority of the viral and bacterial peptides investigated in the delivery constructs (also termed cell-penetrating peptides (CPPs)) comprise cationic peptides rich in basic residues such as lysine and/or arginine, or peptides comprising alpha helix enhancing amino acids. CPPs have been used to transfect cells in vitro in some experimental animal models, but have demonstrated limited success in clinical trials. It has been postulated that the lack of success in the clinic may arise from their lack of specificity for any particular cell type or tissue, as well as the inherent instability of these peptides in vivo (often exhibiting half-lives on the order of several minutes). To circumvent the lack of in vivo stability, several stabilized CPPs have been developed which have been chemically modified to resist standard degradation, for example, by modification to contain non-natural amino acids, including “D” amino-acids. For example, a full “D”-retro-inverso form (“D-TAT”) of the archetypal “TAT” peptide has reached clinical Phase 2, but the potentially extremely long persistence of this peptide has limited its uses to topical administration, e.g., to the ear or intraocular for treatment of inflammation of the eye. The systemic administration of such stabilized peptides is contraindicated by virtue of their potential toxicity.

By replacing only specific positions in the “D-TAT” peptide with L-amino acids, peptides have been obtained with an intermediate half-live that are potentially more suitable for clinical development. The transporter constructs disclosed in the applications WO 2010/072406, WO 2010/072228 or WO2010/072275, comprising amino acid sequences with both L- and D-residues exhibit sufficient stability to resist degradation by proteases prior to transport of the cargo moiety to its target site, but do not appear to permanently persist in the cell. Thus, these combination peptides are to some extent subject to protease degradation. Nevertheless, while effective trans-membrane transporter activity has been demonstrated, it has been found that upon uptake, the cargo moiety of the cargo-transporter construct is not readily cleaved from the transporter moiety, which is generally a prerequisite for the cargo moiety to be or become biologically active. Moreover, it has been found that the degradation of the cargo-transporter construct is slow, such that it exhibits the tendency to accumulate in the target cell. Therefore, even if the attached cargo moiety is eventually released or is metabolized, the transporter construct may remain in the cell for a prolonged time and participate in further inter- and intracellular processes leading to unknown and unwanted side effects. Accordingly, there is a need to develop compounds with improved targeting and improved pharmacodynamics for the intracellular delivery of desired cargo moieties.

3. SUMMARY OF THE INVENTION

The inventors have surprisingly discovered that specific consensus sequences of isolated peptides and/or polypeptides are able to effect specific binding to the proteoglycan keratan sulfate (“KS”) that are also selective for KS over other proteoglycans (e.g., heparin, dermatan, C4S, C6S). Peptides and polypeptides of the invention comprising or consisting of the consensus sequences also exhibit improved in vivo stability relative to similarly sized peptides and, thus, are able to function as improved carrier peptides/peptides for the cellular and intracellular delivery of molecules conjugated to them (e.g., by chemical conjugation or recombinant fusion). The peptides and/or polypeptides according to the invention may consist of or comprise the consensus sequences according to (i) or (ii), or the reverses thereof:

(i) Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇-(LD₈)_(n)-(XD₉)_(m), or (ii) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-X₂-Lys₃-X₄Arg₅-X₆-Lys₇ wherein,

-   -   (a) Arg₁ and Arg₅ represent L-arginine; Lys₃ and Lys₇ represent         L-lysine; LD₈ or LD⁻¹ represents any L- or D-amino acid other         than D-arginine, D-lysine; L-arginine or L-lysine and n has a         value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino         acid other than D-arginine or D-lysine and m has a value of 0 or         1; and wherein the remaining amino acids X₂, X₄ and X₆ may be         independently selected from any L- or D-amino acid other than         D-arginine, D-lysine; L-arginine or L-lysine;         or wherein,     -   (b) Arg₁ and Arg₅ represent D-arginine; Lys₃ and Lys₇ represent         D-lysine; LD₈ or LD⁻¹ represents any L- or D-amino acid other         than D-arginine, D-lysine; L-arginine or L-lysine and n has a         value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino         acid other than D-arginine or D-lysine and m has a value of 0 or         1; and wherein the remaining amino acids X₂, X₄ and X₆ may be         independently selected from any L- or D-amino acid other than         D-arginine, D-lysine; L-arginine or L-lysine.

Consensus sequence (i), above, with conditions (a) is SEQ ID NO:1, and with conditions (b) is SEQ ID NO:2. Consensus sequence (ii), above, with conditions (a) is SEQ ID NO:3, and with conditions (b) is SEQ ID NO:4.

The present inventors have further discovered that, as reflected in the consensus rules above, that the amino acid residues at positions 2, 4, and 6 do not contribute to the binding activity of the peptide. Therefore, the targeting moiety of the above peptides, represented by Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇ (SEQ ID NO:5, where a fragment of SEQ ID NO:1 or SEQ ID NO:3; SEQ ID NO:6, where a fragment of SEQ ID NO:2 or SEQ ID NO:4), can be seen to be to be identical for all possible peptides encompassed by the consensus sequences (i) and (ii), differentiated only in following conditions (a) or (b), above. It is the 3D linear arrangement and/or 3D configuration of side chains of the arginine residues at positions 1 and 5 together with those of the lysine residues at positions 3 and 7 that establishes the binding activity of the targeting moiety sequences and contributes the binding specificity and selectively for KS relative to other proteoglycans. The remaining amino acid residues, i.e., at positions 2, 4 and 6, serve primarily as a structural backbone to maintain the proper distance and relative configuration between the arginines at positions 1 and 5, and the lysines at positions 3 and 7 of the targeting moiety. Thus, provided the 3D spatial orientation of these residues is maintained (in particular, the 3D presentation of the amino acid residue side chains), the peptide, polypeptide and/or compositions of the invention will exhibit specific and selective binding for the proteoglycan KS.

As such, the invention is generally drawn to isolated peptides and polypeptides comprising or consisting of the consensus sequences according to (i) or (ii) above, or to compounds comprising or consisting of such peptides and polypeptides. Further, the invention is also directed to equivalent amino acid residue constructs (or compounds comprising them), which construct is defined as having a 3D presentation of the side chains of 2 arginine residues (i.e., Arg₁ and Arg₅) and the side chains of 2 lysine residues (i.e., Lys₃ and Lys₇) equivalent to those in the peptides and/or targeting moieties of consensus sequences (i) and (ii). Accordingly, the invention encompasses the use of both amino acid residues in the freely selectable positions (i.e., 2, 4 and 6 in the targeting moiety of consensus sequences (i) and (ii)), as well as encompasses the replacement of one or more of these amino acid residues with any appropriate chemical linker or chemical spacer suitable to maintain the equivalent spacing and the relative 3D orientation/presentation of the 2 arginine and 2 lysine residues (i.e., Arg₁, Lys₃, Arg₅, and Lys₇), in particular, suitable to maintain the equivalent spacing and 3D orientation of their side chains. Therefore, the invention encompasses molecules comprising or consisting of amino acid residues and also encompasses molecules comprising both amino acid residues and chemical linkers/spacers such that the molecules exhibit 2 arginines and 2 lysines in the same or an equivalent relative 3-dimensional (3D) confirmation as the arginines at positions 1 and 5 and the lysines positions 3 and 7 as in the targeting moieties of consensus sequences (i) and (ii). As used throughout this disclosure, a molecule comprising the replacement of one or more amino acid residues with a chemical linker/spacer is termed a residue-spacer construct.

For the sake of convenience as used throughout this disclosure, the arginines in the residue-spacer construct that correspond to arginines 1 and 5 of the consensus sequences (i) and (ii) will be referenced as Arg1 and Arg5, respectively. Similarly the lysines in the residue-spacer construct that correspond to lysines 3 and 7 of the consensus sequences (i) and (ii) will be referenced as Lys3 and Lys7, respectively.

In view of the foregoing, the invention is generally directed to molecules wherein Arg1 is linked to Lys₃ via one or more chemical linkers (which may consist of or comprise an amino acid residue). In embodiments where Arg1 is linked to Lys3 via a chemical linker that is not an amino acid residue, it is preferred that the linker is such that the distance between Arg1 and Lys3 is 5.8±1.5 Å when the molecule/construct is in extended conformation. The invention also encompasses in the same or alternate embodiments molecules wherein Lys3 is linked to Arg5 via one or more chemical linkers (which may consist of or comprise an amino acid residue); in embodiments where Lys3 is linked to Arg5 via a chemical linker that is not an amino acid residue, it is preferred that the linker is such that the distance between Lys3 and Arg5 is 4.5±1.5 Å when the molecule/construct is in extended conformation. The invention further encompasses in the same or alternate embodiments molecules wherein Arg5 is linked to Lys7 via one or more chemical linkers (which may consist of or comprise an amino acid residue); in embodiments where Arg5 is linked to Lys7 via a chemical linker that is not an amino acid residue, it is preferred that the linker is such that the distance between Arg5 and Lys7 is 8.5±1.5 Å when the molecule/construct is in extended conformation. Accordingly, the invention encompasses embodiments wherein the linkages between Arg1 and Lys3, between Lys3 and Arg5, and between Arg5 and Lys7 are each independently selected from either (a) an amino acid residue (other than D-arginine, D-lysine; L-arginine or L-lysine and according to other rules as outlined herein), or (b) any chemical linker suitable to maintain the relative positions and 3D orientation of the adjoining amino acid residues and their side chains as presented in consensus sequences (i) and/or (ii).

Care must be taken when replacing one or more residues with chemical linkers according to the methods described herein so that the side chains of Arg1, Lys3, Arg5, and Lys7 retain the same or similar 3D presentation as their counterpart side chains in the peptide and polypeptides of consensus sequences (i) and (ii). Thus, the side chains of Arg1, Lys3, Arg5, and Lys7 within a residue-spacer construct according to the invention should present a linear or near linear arrangement within 3D space when the molecule/construct is in extended conformation.

With the understanding that X₂, X₄, and X₆ of consensus sequences (i) and (ii) may each independently represent either (a) a single D- or L-amino acid residue (other than D-arginine, D-lysine; L-arginine or L-lysine and according to the other selection rules as outlined herein), or (b) a chemical linker, the invention may also be described as directed to at least one isolated molecule, or compounds comprising or consisting of said molecule, wherein the molecule has a consensus structure according to either (iii) or (iv), or the reverses thereof:

(iii) Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))-Lys₇-(LD₈)_(n)- (XD₉)_(m); or (iv) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(SP_(A)) Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇; wherein, LD₈ or LD⁻¹ represent any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; wherein XD₉ or XD⁻² represent any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; wherein

-   -   (a) Arg₁ and Arg₅ represent L-arginine, and Lys₃ and Lys₇         represent L-lysine;     -   or     -   (b) Arg₁ and Arg₅ represent D-arginine; and Lys₃ and Lys₇         represent D-lysine;         wherein (SP_(A)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 5.8±1.5 Å when         the molecule is in extended conformation;         wherein (SP_(B)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 4.5±1.5 Å when         the molecule is in extended conformation;         and wherein (SP_(C)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 8.5±1.5 Å when         the molecule is in extended conformation.

When replacing one or more amino acid residue with a chemical spacer/linker in the a consensus sequence (i) or (ii) (e.g., to form one or more of the molecules according to consensus structures (iii) or (iv)), care must be taken such that the side chains of Arg1, Lys3, Arg5, and Lys7 retain the same or similar 3D presentation as their counterpart side chains in the peptide and polypeptide consensus sequences (i) and/or (ii). Thus, the side chains of Arg1, Lys3, Arg5, and Lys7 within residue-spacer constructs according to the invention should present in a linear or near linear arrangement within 3D space when the molecule and/or construct is in extended conformation.

In certain embodiments the consensus structures (iii) or (iv) according to the invention do not comprise any chemical spacer/linkers, but comprises only amino acid residues. In such embodiments, the invention can be defined as directed to at least one isolated peptide or polypeptide, and/or to compounds comprising or consisting of at least one isolated peptide or polypeptide, which peptide or polypeptide has an amino acid sequence according to the consensus sequences (i) and/or (ii) as defined herein.

In certain embodiments, the consensus structures (iii) or (iv) according to the invention comprise chemical linkers/spacers for SP_(A), SP_(B), and SP_(C) that are not and/or do not comprise amino acid residues. In such embodiments, the invention can be defined as directed to at least one residue-spacer construct, or compounds comprising or consisting of them, wherein the construct has a consensus structure according to either (v) or (vi), or the reverses thereof:

(v) Arg₁-(CL_(A))-Lys₃-(CL_(B))-Arg₅-(CL_(C))-Lys₇-(LD₈)_(n)- (XD₉)_(m); or (vi) (XD₋₂)_(m)-(LD₋₁)n-Arg₁-(CL_(A)) Lys₃-(CL_(B))-Arg₅-(CL_(C))- Lys₇ wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; wherein

-   -   (a) Arg₁ and Arg₅ represent L-arginine, and Lys₃ and Lys₇         represents L-lysine;     -   or     -   (b) Arg₁ and Arg₅ represent D-arginine, and Lys₃ and Lys₇         represent D-lysine;         wherein (CL_(A)) represents a chemical linker that separates the         adjacent amino acid residues by 5.8±1.5 Å when the molecule is         in extended conformation;         wherein (CL_(B)) represents a chemical linker that separates the         adjacent amino acid residues by 4.5±1.5 Å when the molecule is         in extended conformation;         and wherein (CL_(C)) represents a chemical linker that separates         the adjacent amino acid residues by 8.5±1.5 Å when the molecule         is in extended conformation.

The molecules and compounds described above according to consensus sequences (iii) and (iv) (including certain embodiments comprising only amino acid residues according to consensus sequences (i) or (ii) and/or comprising chemical linkers and amino acid residues according to consensus structures (v) or (vi)) find particular use as transport and/or targeting moieties in the molecules and compounds described herein, specifically recognizing and/or binding proteoglycans, e.g., as expressed on the surface of a cell. In particular, the molecules and compounds as described herein bind the proteoglycan KS specifically and selectively over other proteoglycans. The portion of the above consensus sequences (iii) or (iv) acting as the targeting moiety that specifically interacts with the KS proteoglycan is represented by

Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))-Lys₇.

These targeting moieties include the specific embodiments comprising only amino acid residues, i.e., Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇ (SEQ ID NO:5, where a fragment of SEQ ID NO:1 or SEQ ID NO:3; or SEQ ID NO:6, where a fragment of SEQ ID NO:2 or SEQ ID NO:4). These targeting moieties also include the specific embodiments wherein the freely selectable positions (i.e., (SP_(A)), (SP_(B)) and (SP_(C))) are replaced by chemical linkers/spacers, i.e., Arg₁-(CL_(A))-Lys₃-(CL_(B))-Arg₅-(CL_(C))-Lys₇. As explained herein, these targeting moieties are identical molecules with respect to the 3D presentation of the amino acid residues corresponding to Arg1, Lys3, Arg5, and Lys7, in particular, the side chains of these residues. Accordingly, any molecule comprising 2 arginine residues and 2 lysine residues according to consensus sequences (iii) or (iv) can be developed as the targeting moiety according to the following rules:

-   -   (1) Each of the residues Arg1, Lys3, Arg5, and Lys7 must have         the same chirality, i.e., each must be an L-amino acid or each         must be a D-amino acid.     -   (2) Arg1 must be linked to Lys3 using one or more chemical         spacers, which spacer may be either (a) any L- or D-amino acid         other than D-arginine, D-lysine; L-arginine or L-lysine; or (b)         a chemical spacer that separates the amino acids by 5.8±1.5 Å         when the molecule is in an extended conformation.     -   (3) Lys3 must be linked to Arg5 using one or more chemical         spacers, which spacer may be either (a) any L- or D-amino acid         other than D-arginine, D-lysine; L-arginine or L-lysine; or (b)         a chemical spacer that separates the amino acids by 4.5±1.5 Å         when the molecule is in an extended conformation.     -   (4) Arg5 must be linked to Lys7 using one or more chemical         spacers, which spacer may be either (a) any L- or D-amino acid         other than D-arginine, D-lysine; L-arginine or L-lysine; or (b)         a chemical spacer that separates the amino acids by 8.5±1.5 Å         when the molecule is in an extended conformation.     -   (5) Where one or more of the linkages between Arg1 and Lys3,         Lys3 and Arg5, and Arg5 and Lys7 are not amino acid residues,         i.e., are chemical spacers or linkages, they may comprise any         chemical spacer or linker known in the art and/or described         herein suitable for maintaining the required separation between         Arg1 and Lys3, between Lys3 and Arg5, and between Arg5 and Lys7         as well as suitable to maintain a 3D orientation and/or 3/D         presentation of these amino acids (in particular, their side         chains) similar to or the same as that in the corresponding         residues in consensus sequences (i) and (ii). The linkages         between Arg1 and Lys3, between Lys3 and Arg5, and between Arg5         and Lys7 should preferentially not be positively charged.

For all molecules and compounds of the invention, i.e., comprising or consisting of any of consensus sequences (iii) or (iv) (including embodiments according to consensus sequences (i), (ii), (v) and (vi)) it is further preferred, but not necessary, that LD₈ or LD⁻¹ represents L- or D-histidine and that XD₉ or XD⁻² represents D-histidine.

Where the consensus structures of the invention consist of or comprise exclusively amino acids (i.e., correspond to consensus sequences (i) or (ii) and wherein (SP_(A)), (SP_(B)) and (SP_(C)) according to consensus sequences (iii) or (iv) comprise exclusively amino acids), it is preferred, but not necessary, that the structures do not comprise or consist entirely of amino acid residues having a single chirality. That is, it is preferred, but not necessary, that where the consensus structures of the invention consist of or comprise exclusively amino acid residues, the molecules or compounds of the invention comprise at least one L-amino acid residue and at least one D-amino acid residue within their consensus sequences, i.e., that (SP_(A)), (SP_(B)) and (SP_(C)) together comprise at least one L-amino acid and one D-amino acid. As a non-limiting exemplary embodiment, it is preferred, but not required that where Arg1, Lys3, Arg5 and Lys7 are L-amino acids that the at least one D-amino acid is present at position 4 or position 6, i.e., at X₄ or X₆. In related or separate embodiments, it is further preferred that X₂, X₄ and X₆ are each independently selected from the amino acids isoleucine, tyrosine, phenylalanine, tryptophan, valine, methionine, and leucine. It is also preferred that each of X₂, X₄ and X₆ are not concomitantly alanine and/or that the consensus sequences (i) to (vi) contain no more than three alanine residues. It is also preferred in related or separate embodiments that the consensus sequences (i) to (vi) do not comprise proline and/or that none of the freely selectable positions, X₂, X₄ and X₆, is a proline residue.

Non-limiting examples of peptides that may be used according to the methods disclosed herein and, in particular, as defined hereinabove are provided in Table 1.

TABLE 1 Exemplary Peptides Of The Invention* RYKvRIK (SEQ ID NO: 7) RYKvRIKYF (SEQ ID NO: 8) RYKvRIKYFh (SEQ ID NO: 9) RYKvRIKYA (SEQ ID NO: 10) RYKvRIKYAh (SEQ ID NO: 11) RYKvRIKHH (SEQ ID NO: 12) RYKvRIKHHh (SEQ ID NO: 13) RMKiRVK (SEQ ID NO: 14) RMKiRVKFM (SEQ ID NO: 15) RMKiRVKFMh (SEQ ID NO: 16) RLKlRLKLL (SEQ ID NO: 17) RLKlRLKLLh (SEQ ID NO: 18) RYKvRIKYr (SEQ ID NO: 19) *lower case indicates D-enantiomer; upper case indicates L-enantiomer

In addition to the peptides, polypeptides and compounds comprising or consisting of the molecules according to the consensus sequences (i) or (ii), the invention further encompasses variants of the exemplary peptide sequences explicitly disclosed herein. Such variants comprise substitution X₂, X₄ and X₆, e.g., with a conservative amino acid substitution. As is well known in the art, “conservative substitutions” are substitutions with another amino acid having similar characteristics, e.g., small amino acids substituted for small amino acids, acidic amino acids substituted for acidic amino acids, etc. As known in the art, this principle may be applied to any characteristic class of amino acids e.g., polar amino acids, basic amino acids, hydrophobic amino acids and aromatic amino acids. Preferred substitutions for a particular residue according to the present invention may be chosen from among the other members of its conservative substitution group. Six conservative substitution groups are commonly recognized in the art: (1) alanine (A), glycine (G) serine (S) and threonine (T); (2) aspartic acid (D) and glutamic acid (E); (3) asparagine (N) and glutamine (Q); (4) arginine (R), histidine (H) and lysine (K); (5) isoleucine (I), leucine (L), methionine (M) and valine (V); and (6) phenylalanine (F), tyrosine (Y) and tryptophan (W).

However, the substitutions in X₂, X₄ and X₆ of any peptide sequence according to consensus sequences (i) and (ii) expressly disclosed herein need not be made with a member of its conservative substitution group. The invention also contemplates the substitution of any residue at these positions with any other L- or D-amino acid other than L- or D-arginine or L- or D-lysine subject to the rules of the consensus sequences outlined herein. As detailed herein, positions X₂, X₄ and X₆ do not substantially contribute to targeting effects.

In all embodiments of the invention, use of the amino acid proline (P) in any consensus sequences (i) to (vi), e.g., in any of the chemical linkers/spacers defined therein, should be avoided as this residue would destroy or sufficiently distort the 3-dimensional conformation of the molecule or peptide so as to significantly reduce binding activity or affinity.

Any appropriate chemical linker or chemical spacer suitable to maintain the spacing and the relative 3D orientation/presentation of the 2 arginine and 2 lysine residues (i.e., Arg1, Lys3, Arg5 and Lys7) (and their side chains) equivalent to the corresponding residues in consensus sequences (i) and (ii) known in the art or described herein may be used in the construction of the molecules of the invention according to consensus structures (iii) to (vi). Nonlimiting examples of appropriate chemical linkers/spacers include beta- and gamma-peptides; sugar amino acid based scaffolds; beta-hairpin peptidometics; alpha-helical mimetic, beta-sheet/beta-stand mimetics and beta-turn mimetics and cyclotides. In preferred embodiments, the chemical linker/spacers are not positively charged. The linkers should be subject to metabolic breakdown, non-toxic (including metabolites) and should not significantly reduce the solubility of the compounds of the invention.

It is expressly contemplated that the embodiments described throughout this application, whether identified as preferred (including most preferred) or not, can be independently implemented and/or can be combined with the other disclosed embodiments in the design or selection of a peptide of the invention. Therefore, a sequence or structure of the targeting moiety of the peptide, polypeptide, molecule or compound of the invention must satisfy the rules (1) to (5) listed above, and may optionally satisfy none, one, or more than one of the conditions outlined in the other embodiments disclosed herein whether preferred or not.

The invention may comprise the isolated molecules and compounds (including isolated molecules, peptides, polypeptides and compounds comprising or consisting of the molecules, peptides and/or polypeptides) according to consensus sequences (iii) or (iv) (including embodiments according to consensus sequences (i) and (ii) and consensus structures (v) and (vi)) and may further comprise a biologically active moiety (BAM; also referenced as the “cargo” of the targeting moiety). The (BAM)-(targeting moiety) construct may also be referenced as a BAM-conjugate of the invention throughout the disclosure. The BAM may be chemically conjugated to the compounds, peptides, polypeptides and/or molecules of the invention directly and/or may be linked thereto through a linker group.

As used throughout the disclosure, direct conjugation indicates the conjugation of the BAM moiety to any amino acid residue within consensus sequences (iii) or (iv) (including embodiments according to consensus sequences (i) and (ii) and consensus structures (v) and (vi)), or to any amino acid residue or suitable chemical group therein, using any chemical coupling known in the art or described herein suitable for the conjugation of the BAM moiety to an amino acid residue (e.g., an amino acid side chain) and/or chemical group. Accordingly, direct coupling may result in one or more chemical groups spaced between the BAM moiety and the amino acid (e.g., amino acid side chain) or between the BAM moiety and the chemical group of the residue-spacer construct, which groups form as a result of the coupling reaction as is known in the art.

Alternatively, as described herein, the BAM moiety may be conjugated to any amino acid residue or chemical group within consensus sequences (iii) or (iv) (including embodiments according to consensus sequences (i) and (ii) and consensus structures (v) and (vi)), indirectly, that is, via a linker group. Therefore, as used throughout this disclosure, indirect conjugation means that the BAM is conjugated to the linker group, which linker group is conjugated to an amino acid residue or chemical group within a consensus sequence as defined herein. The conjugation between the BAM and the linker group and between the linker group and an amino acid residue or chemical group of a residue spacer construct or molecule of the invention may be any conjugation method and/or compound suitable for effecting such conjugation as described herein or as is otherwise known in the art.

The direct or indirect conjugation of the BAM moiety may be directed to any amino acid residue or chemical linker/spacer group within the molecules of the invention. Thus, the BAM moiety may be directly or indirectly conjugated to an amino acid residue that is at the N or C terminus of consensus sequence (iii) or (iv) (including embodiments according to consensus sequences (i) and (ii) and consensus structures (v) and (vi)). Alternatively or additionally, the BAM moiety may be directly or indirectly conjugated to an internal amino acid residue or chemical linker/spacer group within molecules of the invention. As used throughout this disclosure, an internal residue or internal chemical group references an amino acid residue or chemical group of consensus sequence (iii) or (iv) that is not at the terminus of the linear peptide chain or linear residue-spacer construct. As is known in the art, conjugation methods (whether direct or indirect) may require the chemical modification of one or both sites of conjugation (e.g., modification of an amino acid residue, modification of a chemical group within the molecule of the invention and/or modification of the BAM moiety). Accordingly, the present invention also encompasses chemical modification of the molecules of the invention, compounds comprising or consisting of the molecules, and/or conjugate components.

The compounds and molecules of the invention may also comprise or consist of a peptide, small molecule or other moiety conjugated to a molecule of the invention wherein the peptide, small molecule or other moiety has a desired biological activity, i.e., is a biologically active moiety (“BAM”). The invention encompasses the use of any BAM known in the art or described herein in any form, e.g., peptide, small molecule or other, provided that the BAM has a desired activity and/or recognized therapeutic effect. Therefore, the compounds of the invention may comprise or consist of peptide or non-peptide BAMs conjugated to molecules and/or compounds of the invention by any means known in the art or described herein. In certain embodiments, the BAM-conjugates of the invention have a BAM conjugated at the terminus of a consensus sequence or structure (i) to (vi) as described herein.

Therefore, in these preferred embodiments, the BAM-conjugate has a structure according to the following consensus structures (vii) or (viii), or the reverses thereof:

(vii) (BAM)-(LINK)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇-(LD₈)_(n)-(XD₉)_(m); or (viii) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇-(LINK)-(BAM) wherein (BAM) represents a biologically active moiety; wherein (LINK) represents an optional linker group; wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; wherein

-   -   (a) Arg₁ and Arg₅ represent L-arginine; and Lys₃ and Lys₇         represent L-lysine;     -   or     -   (b) Arg₁ and Arg₅ represent D-arginine; and Lys₃ and Lys₇         represent D-lysine;         wherein (SP_(A)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 5.8±1.5 Å when         the molecule is in extended conformation;         wherein (SP_(B)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 4.5±1.5 Å when         the molecule is in extended conformation;         and wherein (SP_(C)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 8.5±1.5 Å when         the molecule is in extended conformation.

Consensus sequences (vii) and (viii) are identical to consensus sequences (iii) and (iv), respectively, but for the presence of the BAM-moiety and the optional linker group, (LINK). Therefore, selection of the chemical spacers (SP_(A)), (SP_(B)) and (SP_(C)), which may consist of an amino acid residue or comprise chemical linkers, proceeds as disclosed herein with respect to consensus sequences (iii) and (iv), including

-   -   selection of these chemical spacers to consist of or comprise         exclusively amino acid residues according to the embodiments         (and combinations thereof) of consensus sequences (i) and (ii)         as detailed throughout this disclosure, and     -   selection of these chemical spacers to comprise both chemical         linkers and amino acid residues according to the embodiments         (and combinations thereof) of consensus structures (iii) to (vi)         as detailed throughout this disclosure.

Where a linker group is present, (e.g., (LINK) in consensus structures (vii) and (viii)) and/or a linker group used in the indirect linkage of the BAM to an amino acid residue or to a chemical group in any of consensus sequences/structures (i) to (vi), such linker may be any linker, e.g., a peptide linker, known in the art or disclosed herein suitable for linking the BAM to the remaining targeting moiety. The BAM may be chemically conjugated to the linking group. For example, where both the linking group, (e.g., (LINK) of structures (vii) and (viii)), and the BAM are peptides or polypeptides, the BAM may be linked to the linking group via a peptide bond and the linking group may also be linked to the targeting moiety of the consensus sequence/structure via a peptide bond. Non-limiting examples of linker groups include peptide linkers, e.g., comprising one or more residues of glutamic acid, glycine, serine, cysteine and combinations thereof. In certain embodiments, the linking group (LINK) of structures (vii) and (viii) is a single amino acid that is L- or D-glutamic acid.

The invention also encompasses molecules and compounds comprising or consisting of the molecules, e.g., peptide-conjugates/residue-spacer-conjugates/BAM-conjugates, that do not comprise a linking group, i.e., wherein consensus structures (vii) and (viii) lack the (LINK) moiety and/or comprise direct linkage of the BAM moiety to any amino acid residue or chemical group within any consensus sequence/structure (i) to (vi). Where the BAM-conjugate of the invention is lacking the linking group, the BAM may be conjugated, e.g., chemically conjugated, directly to the molecules' amino acid residue or chemical group (e.g., a chemical group of the chemical linker/spacer). Non-limiting examples of such chemical conjugation include covalent attachment to the molecule at the N-terminus and/or to the N-terminal amino acid residue via an amide bond or at the C-terminus and/or C-terminal amino acid residue via an ester bond. Where the BAM is a peptide or polypeptide, the BAM may be directly conjugated to the N-terminus and/or N-terminal amino acid or to the C-terminus and/or C-terminal amino acid via a peptide bond.

The invention encompasses any BAM expected to exert a therapeutically relevant activity on administration to an organism or on delivery to one or more cells of an organism, whether in vitro or in vivo. Accordingly, non-limiting examples of BAMs encompassed by the invention include mono- and poly-saccharides, cytotoxic agents, antineoplastic agents, anti-inflammatory agents, anti-viral agents, anti-bacterial agents, agents for the treatment of protozoan infections, and hormonal agents. The BAM may also be a deoxyribose or ribose.

Non-limiting examples of anti-neoplastic agents that may be used as BAMs according to the methods of the invention include, but are not limited to, aldesleukin, alemtuzumab, alitretinoin, allopurinol, altretamine, amifostine, anastrozole, abiraterone, arsenic, axitinib, azacitidine, bendamustine, bexarotene, bleomycin, bortezomib, busulfan, cabazitaxel, calusterone, capecitabine, carboplatin, carfilzomib, carmustine, carmustine, celecoxib, chlorambucil, cisplatin, cladribine, clofarabine, crizotinib, cyclophosphamide, cytarabine, dacarbazine, dactinomycin, actinomycin D, dasatinib, daunorubicin, decitabine, dexrazoxane, docetaxel, doxorubicin, entinostat, epirubicin, eribulin, erlotinib, estramustine, etoposide, everolimus, exemestane, fostamatinib, floxuridine, fludarabine, fluorouracil, 5-FU, fulvestrant, gefitinib, gemcitabine, hydroxyurea, idarubicin, lenalidomide, ifosfamide, imatinib, iomustine, irinotecan, isotretinoin, ixabepilone, lapatinib, lenalidomide, letrozole, leucovorin, levamisole, lomustine, CCNU, marizomib, meclorethamine, nitrogen mustard, melphalan, L-PAM, mercaptopurine, 6-MP, mertansine, mesna, methotrexate, methoxsalen, mitomycin, mitotane, mitoxantrone, nandrolone, nelarabine, nilotinib, oxaliplatin, paclitaxel, pamidronate, pazopanib, pegademase, pemetrexed, pentostatin, pipobroman, plerixafor, plicamycin, mithramycin, porfimer, pralatrexate, procarbazine, quinacrine, rapamycin, romidepsin, ruxolitinib, sorafenib, streptozocin, sunitinib, tamoxifen, temozolomide, temsirolimus, teniposide, VM-26, testolactone, thalidomide, thioguanine, 6-TG, thiotepa, topotecan, toremifene, tretinoin, ATRA, uracil mustard, valrubicin, vandetanib, vemurafenib, verteporfin, vinblastine, vincristine, vinorelbine, vismodegib, vorinostat, zoledronate, nucleoside analogues AZT, b-D-arabinofuranose, vidarabine, 2-chlorodeoxyadenosine, intercalating drugs, kinase inhibitors, cofarabine, laromustine, clophosphamide, asparaginase, dexamethasone, prednisone and lestaurtinib. The above-listed anti-neoplastic agents may be used in accordance with the methods disclosed herein not only in connection with neoplastic diseases, but also in the treatment, prevention and/or amelioration of other diseases, or symptoms thereof, as is known in the art, e.g., in connection with the treatment, prevention and/or amelioration of anti-inflamatory or autoimmune diseases, and/or symptoms thereof.

Examples of anti-inflammatory agents that may be used as BAMs according to the methods of the invention include, but are not limited to, COX-2 inhibitors, prednisone, pazopanib, famotidine, dalfampridine, pegloticase, esomeprazole, aspirin, celecoxib, diclofenac, valdecoxib, rofecoxib, diflunisal, etodolac, fenoprofen, flurbiprofen, ibuprofen, indomethacin, ketoprofen, ketorolac, mefenamic acid, meloxicam, nabumetone, naproxen, oxaprozin, piroxicam, sulindac, tolmetin, lansoprazole, meclofenamate, triamcinolone, methylprednisolone, betamethasone, budesonide, prednisolone, hydrocortisone, dexamethasone and cortisone.

Examples of anti-protozoal agents that may be used as BAMs according to the methods of the invention include, but are not limited to, chloroquine, mefloquine, primaquine, proguanil hydrochloride, proguanil hydrochloride with atovaquone, pyrimethamine, sulfadoxine, quinine, quinoline, doxycycline, clindamycin, artesunate, diloxanide, metronidazole, tinidazole, mepacrine hydrochloride, amphotericin, pentamidine, pyrimethamine, sulfadiazine, azithromycin, atovaquone, trimethoprim-sulphamethoxazole, trimethoprim, dapsone, atovaquone, pentamidine isetionate, amodiaquine, chloroguanide, eflornithine, hydroxychloroquine, iodoquinol, meglumine antimonate, melarsoprol, nifurtimox, paromomycin, sodium stibogluconate, suramin, and tryparsamide.

Examples of hormonal agents that may be used as BAMs according to the methods of the invention include, but are not limited to, afimoxifene, arzoxifene, bazedoxifene, clomifene, fulvestrant, femarelle, lasofoxifene, ospemifene, ormeloxifene, tamoxifen, toremifene, raloxifene, dienogest, mifepristone and ulipristal acetate.

As detailed herein, the compounds of the invention comprise or consist of a BAM-conjugate (i.e., a BAM conjugated to the targeting moiety; also referenced as a peptide-conjugate) that may effect intracellular transport of the BAM. Accordingly, in preferred embodiments, the invention encompasses a pharmaceutical composition comprising a compound of the invention, e.g., a BAM-conjugate, and a pharmaceutically acceptable carrier. The pharmaceutical compositions of the invention comprising, e.g., the BAM-conjugate, may be used for any indication known or predicted to be treatable with the BAM. For example, where the BAM is an anti-viral agent, the pharmaceutical composition comprising the BAM conjugate may be used in the treatment of a viral infection, or a symptom thereof, including but not limited to HIV; Epstein Barr virus; morbillivirus; paramyxovirus; rubivirus; herpes virus; dengue virus; herpes simplex virus; parvovirus; respiratory syncytial virus; variola virus; varicella; flavivirus; human T-lymphotropic virus; hepatitis virus A, B, C, D or E, lassa virus and/or influenza virus. Where the BAM is an anti-protozoal agent, the pharmaceutical composition comprising the BAM-conjugate may be used in the treatment of protozoal infections, such as leshmaniasis and/or malaria.

The targeting moieties of consensus sequences/structures (i) to (viii) interact specifically with proteoglycans, in particular, keratan-sulfate (KS). KS is preferentially expressed relative to other proteoglycans, by certain tissues, for example, tissues of the eye (e.g., cornea), the central nervous system, and the endometrial lining of the uterus. Accordingly, the invention encompasses the use of the molecules and compounds, e.g., BAM-conjugates, to specifically target cells expressing KS and/or tissues comprising such cells. The invention also encompasses the use of the molecules and compounds, e.g., BAM-conjugates, to facilitate transport of the BAM moiety into cells expressing KS, e.g., corneal cells.

The molecules and compounds of the invention (including isolated molecules peptides, polypeptides and compounds comprising or consisting of any of consensus sequences (i) to (viii)) may find particular use in the treatment of KS accumulation diseases such as morquio syndrome. The molecules and compounds of the invention may also find particular use in the targeting of tissues or cells of the eye, e.g., corneal cells. Thus, the molecules of the invention may find particular use in the treatment, prevention or amelioration of symptoms associated with diseases or conditions in which these cells are pathophysiologically involved or have pathophysiological involvement. Nonlimiting examples of diseases or conditions affecting corneal cells that may be treated using the molecules and compound of the invention include corneal ulcer, corneal flash burn, corneal abrasion, corneal neovascularization, keratitis, ocular herpes, pterygium, trachomacorneal abrasion, corneal dystrophy, Fuch's dystrophy, iridocorneal endothelial syndrome, lattice dystrophy, map-dot-fingerprint dystrophy, corneal infection, ocular herpes, and dry eye.

The molecules and compounds of the invention (including isolated molecules peptides, polypeptides and compounds comprising or consisting of any of consensus sequences (i) to (viii)) may find particular use in the targeting of cells and tissues of the retina. Thus, the molecules and compounds of the invention may find particular use in the treatment, prevention or amelioration of symptoms associated with diseases or conditions affecting such cells. Non-limiting examples of such diseases and conditions affecting the retina that may be treated using the molecules and compounds of the invention include age-related macular degeneration, diabetic retinopathy, retinitis pigmentosa, retinal degeneration, macular hole, retinoblastoma, glaucoma and onchocerciasis.

The molecules and compounds of the invention (including isolated molecules peptides, polypeptides and compounds comprising or consisting of any of consensus sequences (i) to (viii)) find particular use in the targeting of cells of the central nervous system. Thus, the molecules of the invention may find particular use in the treatment, prevention or amelioration of symptoms associated with diseases or conditions affecting such cells. Nonlimiting examples of such diseases or conditions that may be treated using the molecules and compound of the invention include anxiety disorders, depression, epilepsy, spinal muscular atrophy, amyotrophic lateral sclerosis, Alzheimer's disease, autism, Ataxia Telangiectasia, Niemann Pick disease Type C, cerebellar degeneration, Machado-Joseph disease, olivopontocerebella atrophy, spinocerebella ataxias, sporadic ataxia, multiple system atrophy, Parkinson's disease, Parkison's syndrome, gait ataxia, herpes zoster, viral encephalitis, Japanese encephalitis, bacterial encephalitis, toxoplasmosis, malaria, amoebic meningoencephalitis, Cryptococcus neoformans encephalitis, Lyme disease, streptococci meningoencephalitis, staphylococci meningoencephalitis, astrocytoma, glioblastoma, oligodendroglioma, ependymoma, medulloblastoma and meningioma.

The molecules and compounds of the invention (including isolated molecules peptides, polypeptides and compounds comprising or consisting of any of consensus sequences (i) to (viii)) find particular use in the targeting of cells of the endometrial lining of the uterus. Thus, the molecules of the invention may find particular use in the treatment, prevention, regulation of physiological function or amelioration of symptoms associated with diseases, conditions or physiological functions related to such cells and tissues. Nonlimiting examples of such diseases, conditions or physiological functions that may be treated using the molecules and compounds of the invention include endometrial cancer, endometrial hyperplasia, endometrial polyp, uterine bleeding, endometriosis, adenomyosis, cervicitis, infertility, contraception, and regulation of the menstrual cycle.

4. DEFINITIONS

The term “chemical derivatives” as used herein refers to the (chemical) modification of amino acids, amino acid side chains and peptide bonds (including those at the N-terminus, at the C-terminus, and within the backbone of the consensus structure/sequence) as well as modification of any chemical group within the chemical spacer/linker of consensus structures (iii) to (vi). The term is not intended to refer to any addition, substitution or deletion of amino acids residues in an amino acid or peptide chain. Chemical derivatives from L-amino acids or L-enantiomeric amino acids typically comprise any naturally or non-naturally occurring derivative of these amino acids, including, without being limited thereto, amino acids as defined above comprising post-translational modifications or synthetic modifications, including acetylation (at the N-terminus of the (poly-)peptide sequence, at lysine residues, etc.), deacetylation, alkylation (such as methylation, ethylation, etc. (preferably at lysine or arginine residues within the (poly-)peptide sequence)), dealkylation (such as demethylation, deethylation, etc., amidation (preferably at the C-terminus of the (poly-)peptide sequence)), formylation, gamma-carboxylation, glutamylation, glycosylation (preferably at asparagine, lysine, hydroxylysine, serine or threonine residues, etc., within the (poly-)peptide sequence), addition of a heme or haem moiety, hydroxylation, iodination, isoprenylation addition of an isoprenoid moiety such as farnesyl or geranylgeraniol, etc.), lipoylation (attachment of lipoate functionality), such as prenylation, formation of a GPI anchor, including myristoylation, farnesylation, geranylgernaylation, etc., oxidation, phosphorylation (e.g. to a serine, tytosine, threonine or a histidine moiety, etc., within the (poly-)peptide sequence), sulfation (e.g., of tyrosine), selenoylation, sulfation, etc. Chemical derivatives of amino acids also include, without being limited thereto, modified amino acids, which have been modified by introducing a label, including radioactive labels, a dye or fluorescent group, or a chemoluminesent group.

As used herein, the term “freely selectable positions”, “freely selectable amino acids” and analogous terms reference the amino acid residues (if present) at the positions corresponding to those positions in consensus sequences (i) and (ii) that are not explicitly defined. The targeting moiety of consensus sequences (i) and (ii) is represented by Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇, wherein Arg₁ and Arg₅ represent D- or L-arginine (all of the same chirality), and Lys₃ and Lys₇ represent D- or L-lysine (both having the same chirality as Arg₁ and Arg₅) wherein the remaining amino acids, i.e., the freely selectable positions, X₂, X₄ and X₆ may be any L- or D-amino acid other than L-lysine, D-lysine, L-arginine or D-arginine. The term freely selectable positions also references those amino acid residues (where present) in the corresponding positions in the residue-spacer constructs of the invention. As further defined herein, the invention also encompasses the replacement of the freely-selectable amino acids with chemical spacers to form the residue-spacer constructs such that the relative 3D orientation of the 2 arginines and 2 lysines is equivalent to that of Arg₁, Lys₃, Arg₅ and Lys₇ of consensus sequences (i) and (ii). The chemical spacers/linkers that may replace the freely selectable amino acid in the molecules of the invention (e.g., a residue-spacer construct) are represented by (SP_(A)), (SP_(B)), and (SP_(C)) in consensus structures (iii) and (iv), and (vii) and (viii); and by (CL_(A)), (CL_(B)) and (CL_(C)) in consensus structures (v) and (vi).

In the context of the present invention, L-amino acids, also known in the art and referenced herein as L-enantiomeric amino acids, are preferably amino acids selected from natively occurring amino acids or their derivatives. Naturally occurring amino acids are recognized as the standard (proteinogenic) amino acids alanine, arginine, asparagine, aspartic acid, cysteine, glutamine, glutaminic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, phenyl alanine, proline, serine, threonine, tryptophan, tyrosine, and valine, but also include non-standard amino acids such as ornithine, citrulline, homocysteine, S-adenosyl methionione, hydroxyproline, selenocysteine, pyrrolysine, lanthionine, 2-aminoisobutyric acid, dehydroalanine, and gamma-aminobutyric acid.

Similarly, in the context of the present invention, D-amino acids, also known in the art and referenced herein as D-enantiomeric amino acids, are preferably non-native (non-proteinogenic) “retro-inverso” amino acids, wherein these non-native (non-proteinogenic) “retro-inverso” amino acids are recognized as derived from naturally occurring L-amino acids and/or their derivatives as defined above. In this context, the term “retro-inverso” refers to an isomer of a naturally occurring L-amino acid as defined above (and peptides made therefrom) in which the chirality of the naturally occurring L-amino acid residue is inverted in the corresponding D-amino acid. In other words, in the peptide bonds of D-amino acids the positions of carbonyl and amino groups are exchanged, while the position of the side-chain groups at each alpha carbon is preserved. Accordingly, D-amino acids may be inserted into a peptide sequence consisting of or comprising L-amino acids and therefore may be conjugated with L-amino acids as defined above by methods known in the art or defined herein.

Subject to the remaining selection rules as defined herein, where the consensus structures of the invention consist of or comprise exclusively amino acid residues, it is preferred, but not required, that the molecules and compounds of the invention corresponding to these consensus structures comprise at least one L-amino acid residue and at least one D-amino acid residue.

The amino acids abbreviations used in the present disclosure are shown in the following Table of Correspondence. As used throughout the present disclosure and as detailed in the following table, upper-case or a capital letter in the 1 letter symbol references an L-amino acid, while a lowercase 1 letter symbol references a D-amino acid.

Symbol 1-Letter L-amino D-amino acid acid 3-Letter Amino acid A a Ala Alanine R r Arg Arginine N n Asn Asparagine D d Asp Aspartic acid C c Cys Cysteine E e Glu Glutamic acid Q q Gln Glutamine G g Gly Glycine H h His Histidine I i Ile Isoleucine L l Leu Leucine K k Lys Lysine M m Met Methionine F f Phe Phenylalanine P p Pro Proline S s Ser Serine T t Thr Threonine W w Trp Tryptophan Y y Tyr Tyrosine V v Val Valine X x Xaa Unknown or other

As used herein, the term “peptides”, “polypeptides” and “proteins” have their meaning as generally understood as in the art, i.e., referring to a chain of amino acids. However, the terms shall not be construed as limiting the length of the amino acid chain.

5. BRIEF DESCRIPTION OF FIGURES

FIG. 1 Schematic detailing the average distances between potentially interacting chemical residues of TAT and KS. The boxed numbers identify the residue numbering for each molecule used in the simulations and as described herein.

FIG. 2 Representation of TAT peptide in extended conformation, indicating calculated distances between certain chemical groups according to model predictions.

FIG. 3 Representation of KS molecule in extended conformation, indicating calculated distances between certain chemical groups according to model predictions.

FIG. 4 Binding of a peptide of the invention having the sequence RYKvRIK (“RYK-KS”; SEQ ID NO:7) to various proteoglycans (4A) as compared to the binding of the TAT peptide (4B); HS: heparan sulfate; C4S: chondroitin-4-sulfate; C6S: chondroitin-6-sulfate; KS: keratan-sulfate. The RYK peptide of the invention specifically and selectively binds KS over other proteoglycans.

FIG. 5 (5A) Binding of peptides according to the invention to KS. Upper case letters indicate L-enantiomers and lower case letters indicate D-enantiomers. The tested peptides are RKKRRQRR (SEQ ID NO:20), RRRRRRRRR (SEQ ID NO:21), RYKvRIKYFh (SEQ ID NO:9), RYKvRIKYAh (SEQ ID NO:11), RYKvRIKYr (SEQ ID NO:19), RYKvRIK (SEQ ID NO:7), RMKiRVK (SEQ ID NO:14), RMKiRVKFMh (SEQ ID NO:16), RYKvRIKHHh (SEQ ID NO:13), RLKIRLKLLh (SEQ ID NO:18), RYKvRPKYF (SEQ ID NO:22), and RYKvFIKPRh (SEQ ID NO:23). (5B) Binding of peptides according to the invention and controls to C4S, C6S, KS, HS and HA. The tested peptides are “RYRI” (RYKvRIK; SEQ ID NO:7), RPKHRPK (SEQ ID NO:26), and IRIkIrlK (SEQ ID NO:27). Binding to KS is maintained where consensus sequences (i) and/or (ii) are followed, i.e., positions 1 and 5 are arginines, and positions 3 and 7 are lysines, each of the residues at positions 1, 3, 5 and 7 having the same chirality; binding is also maintained on mutation of positions 2, 4 and 6 to comprise various residues except proline.

FIG. 6 CHST1 expression data from Atlas arrays.

FIG. 7 CHST6 expression data from Atlas arrays.

FIG. 8 Percent viability of MDCK cell cultures following exposure to a model BAM-conjugate. Viability was determined by trypan blue exclusion following 96 hour incubation with different concentrations of 3′AdT conjugated to the amino-terminus of the peptide “RYRI” (RYKvRIK (SEQ ID NO:7)) by a single D-Glu residue (3′AdT-e-RYRI).

6. DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to molecules, peptides, polypeptides and compounds comprising or consisting of such molecules, peptides or polypeptides having specific consensus sequences or structures and their use for specific transport of biologically active moieties (BAMs) to the intercellular environment, i.e., the cytoplasm and/or the nucleus. The inventors have surprisingly discovered that certain consensus structures or sequences of molecules and peptides are able to selectively target specific proteoglycans, in particular keratan-sulfate (KS). Accordingly, the molecules, proteins, polypeptides and compounds comprising or consisting of such molecules, peptide or polypeptide comprising or consisting of one or more of the consensus sequences disclosed herein may find particular use as transport moieties for the targeted delivery of BAMs to cells expressing KS.

Although the binding of certain peptides to proteoglycans and phospholipids is known, it has previously been viewed as a non-specific process driven primarily though ubiquitous hydrostatic interactions. As such, it has not been previously suggested that such interactions can be manipulated to impart selectivity, preventing the peptides from being used for specific targeting. Despite the perceived ubiquitous nature of these interactions, the present inventors have surprisingly discovered that specific peptide sequences can be designed that exhibit selectivity for certain proteoglycans over phospholipids in general. Specifically, the present inventors have discovered that a peptide comprising a 7 residue sequence having arginines at positions 1 and 5, and lysines at positions 3 and 7, or a molecular construct exhibiting an equivalent relative 3D confirmation of these 4 residues (e.g., a residue-spacer construct), optimally interacts via electrostatic bridging with the sulfates and carboxylates on the aminoglycan units of KS. Moreover, including non-charged amino acids or non-positively charged chemical spacers in the remaining positions provides selectivity, inhibiting or substantially reducing the non-specific interaction of the peptide with other proteoglycans and/or phospholipids. Thus, the unique combination of residues and/or 3D presentation of specific residues allow peptides and residue-spacer constructs to be designed that exhibit selective binding for KS proteoglycans over other proteoglycans and other phospholipids.

The remaining residues within the 7-mer, i.e., the residues at positions 2, 4 and 6, are not predicted by the model to be involved in the binding of the peptide to KS, and, provided the three dimensional linear conformation of residues 1, 3, 5, and 7 is not disturbed in the peptide, substitution with a wide variety of amino acids (including non-natural amino acids, such as D-amino acids) at these freely selectable positions had little to no effect on the specific binding to KS. Accordingly, the inventors surprisingly discovered that provided the 3D presentation and/or orientation of these 2 arginine residues and 2 lysine residues is maintained in the molecule, KS specific binding will also be maintained. Therefore, the invention encompasses the replacement of the residues at these freely-selectable positions with chemical spacers provided that the relative 3D orientation of the arginines at positions 1 and 5, and the lysines at positions 3 or 7 is maintained so as to be equivalent to their presentation in consensus sequences (i) and/or (ii) (in particular with respect to their side chains). However, the use of the charged amino acids arginine or lysine in these freely selectable positions (where amino acid residues are present) and/or charged linking groups, in particular positively charged groups, increased the likelihood of non-specific interactions with other proteoglycans and phospholipids. Therefore, to maintain specificity, the use of arginine and lysine in the freely selectable positions is avoided in the peptides of the invention and/or the use of charged linking groups is avoided where one or more of the freely selectable residues is replaced by a chemical linker in the residue-linker constructs of the invention.

Additionally, it was discovered that specificity can be maintained using D-amino acids in the molecules of the invention (e.g., peptides or residue-spacer constructs) at the positions relevant for KS binding, i.e., Arg1, Lys3, Arg5, and Lys7, provided that all positions have the same chirality. It is believed that the use of a single chirality at these positions presents the same three-dimensional structure of the amino acid side chains. Where one or more of these positions has a differing chirality, the three-dimensional linear arrangement is distorted. Thus, the invention encompasses a molecule comprising 2 arginine residues and 2 lysine residues, wherein each of the 4 residues is a L-residue or wherein each of the 4 residues is a D-residue, provided that the molecule presents the residues in the same or similar 3D orientation and configuration as Arg1, Lys3, Arg5, and Lys7 in consensus sequences (i) and/or (ii) (in particular, when the molecules are in solution).

The consensus sequences of the molecules of the invention are preferably envisioned as transporter moieties, of use in the intercellular transport of associated cargo molecules/moieties. Proteoglycans and/or molecules comprising proteoglycans on the cellular surface are known to be cycled via endosomes. To favor release of the compound of the invention from the endosome (and help further avoid degradation), one to several amino acid residues capable of acting as a proton sponge can optionally be added to the N- or C-terminus of the transporter moiety. The proton sponge increases the osmolarity of the endosome, causing swelling and rupture, leading to cytosolic release of its contents rather than their degradation. Any amino acid residue or combination of residues known in the art or described herein capable of functioning as a proton sponge can be used in accordance with the invention. The final amino acid of this optional series of proton sponge resides (i.e., the final N-terminal or C-terminal residue of the proton sponge residues (where present)) may also optionally be a D-entantiomer to avoid degradation/cleavage of the proton sponge residues. Accordingly, the invention encompasses the optional use of one or more amino acid residues added to the N- or C-terminus of the transporter moiety, wherein the N- or C-terminal residue, respectively, is optionally a D-entantiomer. In preferred embodiments, the one or more residues of the proton sponge residues is(are) not lysine or arginine residues. The optional residues of the proton sponge as defined herein are represented in the consensus sequences (i) to (viii) of the invention by LD₈ or LD⁻¹. The optional N- or C-terminal residue of the proton sponge is represented in the consensus sequences (i) to (viii) of the invention by XD₉ or XD⁻². Thus, in preferred embodiments, LD₈ or LD⁻¹ of consensus sequences (i) to (viii) (where present) represents any L- or D-amino acid other than L- or D-arginine or L- or D-lysine; in connection with these preferred embodiments or as independent embodiments, XD₉ or XD⁻² of consensus sequences (i) to (viii) (where present) is preferentially any D-amino acid other than D-arginine or D-lysine. In the most preferred embodiments, the one or more residues of the proton sponge residues is(are) histidine residues. Thus, in the most preferred embodiments, LD₈ or LD⁻¹ of consensus sequences (i) to (viii) represents L- or D-histidine; in connection with these most preferred embodiments or as independent embodiments, XD₉ or XD⁻² of consensus sequences (i) to (viii) is most preferentially D-histidine.

6.1 Molecule Consensus Structures

The above discoveries of the inventors may be presented as a consensus structure.

The invention is generally directed to at least one isolated molecule, or compounds comprising or consisting of said molecule, wherein the molecule has a consensus structure according to (iii) or (iv), or the reverses thereof:

(iii) Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))-Lys₇-(LD₈)_(n)- (XD₉)_(m); or (iv) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇ wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; wherein XD₁₁ or XD-2 represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; wherein

-   -   (a) Arg₁ and Arg₅ represent L-arginine, and Lys₃ and Lys₇         represent L-lysine;     -   or     -   (b) Arg₁ and Arg₅ represent D-arginine; and Lys₃ and Lys₇         represent D-lysine;         wherein (SP_(A)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 5.8±1.5 Å when         the molecule is in extended conformation;         wherein (SP_(B)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 4.5±1.5 Å when         the molecule is in extended conformation;         and wherein (SP_(C)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 8.5±1.5 Å when         the molecule is in extended conformation.

When the molecule or compound of the invention comprises one or more chemical spacers/linkers that is not an amino acid residue, care must be taken such that the side chains of Arg1, Lys3, Arg5 and Lys7 retain the same or similar 3D presentation as their counterpart side chains in the peptide and/or polypeptide consensus sequences (i) and/or (ii) as defined herein. Thus, the side chains of Arg1, Lys3, Arg5 and Lys7 within residue-spacer constructs according to the methods of the invention should present in a linear or near linear arrangement within 3D space when the molecule and/or construct is in extended conformation.

6.2 Peptide Consensus Sequences

In embodiments of the invention wherein the molecules and/or compounds according to consensus structures (iii) or (iv) comprise or consist only of amino acid residues, the consensus structures may be represented by consensus sequences of a peptide or polypeptide. Accordingly, in certain embodiments, the present invention is directed to compounds comprising or consisting of at least one isolated peptide or polypeptide having an amino acid sequence according to the following consensus sequence (i) or (ii), or the reverses thereof:

(i) Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇ (LD₈)_(n)-(XD₉)_(m), or (ii) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-X₂-Lys₃-X₄-Arg₅-X₅-Lys₇ wherein,

-   -   (a) Arg₁ and Arg₅ represent L-arginine; Lys₃ and Lys₇ represent         L-lysine; LD₈ or LD⁻¹ represents any L- or D-amino acid other         than D-arginine, D-lysine; L-arginine or L-lysine and n has a         value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino         acid other than D-arginine or D-lysine and m has a value of 0 or         1; and wherein the remaining amino acids X₂, X₄ and X₆ may be         any L- or D-amino acid other than D-arginine, D-lysine;         L-arginine or L-lysine;         or wherein,     -   (b) Arg₁ and Arg₅ represent D-arginine; Lys₃ and Lys₇ represent         D-lysine; LD₈ or LD⁻¹ represents any L- or D-amino acid other         than D-arginine, D-lysine; L-arginine or L-lysine and n has a         value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino         acid other than D-arginine or D-lysine and m has a value of 0 or         1; and wherein the remaining amino acids X₂, X₄ and X₆ may be         any L- or D-amino acid other than D-arginine, D-lysine;         L-arginine or L-lysine.

Consensus sequence (i), above, with conditions (a) is SEQ ID NO:1, and with conditions (b) is SEQ ID NO:2. Consensus sequence (ii), above, with conditions (a) is SEQ ID NO:3, and with conditions (b) is SEQ ID NO:4.

6.3 Residue-Spacer Consensus Structures

In embodiments of the invention wherein the chemical linkers/spacers represented by SP_(A), SP_(B), and SP_(C) in consensus structures (iii) or (iv) do not comprise amino acid residues, the consensus structures may be represented by consensus structures for a residue-spacer construct. In such embodiments, the invention can be defined as directed to at least one residue-spacer construct, or compounds comprising or consisting of them, wherein the construct has a consensus structure according to (v) or (vi), or the reverses thereof:

(v) Arg₁-(CL_(A))-Lys₃-(CL_(B))-Arg₅-(CL_(C))-Lys₇-(LD₈)_(n)- (XD₉)_(m); or (vi) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(CL_(A)) Lys₃-(CL_(B))-Arg₅-(CL_(C))- Lys₇ wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; wherein

-   -   (a) Arg₁ and Arg₅ represent L-arginine, and Lys₃ and Lys₇         represents L-lysine;     -   or     -   (b) Arg₁ and Arg₅ represent D-arginine, and Lys₃ and Lys₇         represent D-lysine;         wherein (CL_(A)) represents a chemical linker that separates the         adjacent amino acid residues by 5.8±1.5 Å when the molecule is         in extended conformation;         wherein (CL_(B)) represents a chemical linker that separates the         adjacent amino acid residues by 4.5±1.5 Å when the molecule is         in extended conformation;         and wherein (CL_(C)) represents a chemical linker that separates         the adjacent amino acid residues by 8.5±1.5 Å when the molecule         is in extended conformation.

When the molecule or compound of the invention comprises one or more chemical spacers/linkers that is not exclusively comprised of amino acid residues, care must be taken such that the side chains of Arg1, Lys3, Arg5, and Lys7 retain the same or similar 3D presentation as their counterpart side chains in the peptide and/or polypeptide consensus sequences (i) and/or (ii). Thus, the side chains of Arg1, Lys3, Arg5, and Lys7 within residue-spacer constructs according to the methods of the invention should present in a linear or near linear arrangement within 3D space when the molecule and/or construct is in extended conformation.

6.4 Targeting Moieties

The molecules and compounds described above according to consensus sequences (iii) and (iv) find particular use as transport and/or targeting moieties, specifically recognizing and/or binding proteoglycans, e.g., as expressed on the surface of a cell, in particular, binding the proteoglycan KS selectively over other proteoglycans. The portion of the above consensus sequences (iii) and (iv) acting as the targeting moiety that specifically interact with the KS proteoglycan is represented by

Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))-Lys₇.

These targeting moieties include the specific embodiments having consensus sequences or structures comprised exclusively of amino acids, i.e., Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇ (SEQ ID NO:5, where a fragment of SEQ ID NO:1 or SEQ ID NO:3; or SEQ ID NO:6, where a fragment of SEQ ID NO:2 or SEQ ID NO:4). These moieties also include the specific embodiments wherein the freely selectable positions (i.e., (SP_(A)), (SP_(B)) and/or (SP_(C))) are replaced by chemical linkers, i.e., Arg₁-(CL_(A))-Lys₃-(CL_(B))-Arg₅-(CL_(C))-Lys₇. The targeting moieties are identical molecules with respect to the 3D presentation of the amino acid residues corresponding to Arg1, Lys3, Arg5, Lys7 in consensus sequences (i) and/or (ii), and, in particular, the sides chains of these residues. Accordingly, any molecule comprising 2 arginine residues and 2 lysine residues according to consensus sequences (iii) or (iv) can be developed as the targeting moiety according to the following rules:

-   -   (1) Each of the residues Arg1, Lys3, Arg5, and Lys7 must have         the same chirality, i.e., each must be an L-amino acid or each         must be a D-amino acid.     -   (2) Arg1 must be linked to Lys3 using one or more chemical         spacers, which spacer may be either (a) any L- or D-amino acid         other than D-arginine, D-lysine; L-arginine or L-lysine; or (b)         a chemical spacer that separates the amino acids by 5.8±1.5 Å         when the molecule is in an extended conformation.     -   (3) Lys3 must be linked to Arg5 using one or more chemical         spacers, which spacer may be either (a) any L- or D-amino acid         other than D-arginine, D-lysine; L-arginine or L-lysine; or (b)         a chemical spacer that separates the amino acids by 4.5±1.5 Å         when the molecule is in an extended conformation.     -   (4) Arg5 must be linked to Lys7 using one or more chemical         spacers, which spacer may be either (a) any L- or D-amino acid         other than D-arginine, D-lysine; L-arginine or L-lysine; or (b)         a chemical spacer that separates the amino acids by 8.5±1.5 Å         when the molecule is in an extended conformation.     -   (5) Where one or more of the linkages between Arg1 and Lys3,         Lys3 and Arg5, and Arg5 and Lys7 are not amino acid residues,         i.e., are chemical spacers or linkages, they may comprise any         chemical spacer or linker known in the art and/or described         herein suitable for maintaining the required separation between         Arg1 and Lys3, between Lys3 and Arg5, and between Arg5 and Lys7         as well as suitable to maintain a 3D orientation and/or 3/D         presentation of these amino acids (in particular, their side         chains) similar to or the same as that in the corresponding         residues in consensus sequences (i) and (ii). The linkages         between Arg1 and Lys3, between Lys3 and Arg5, and between Arg5         and Lys7 should preferentially not be positively charged.         6.5 Chemical Linkers

SP_(A), SP_(B), and SP_(C) in consensus sequences (iii) and (iv) represent chemical linkers/spacers that form part of the backbone of the molecule of the invention and each may comprise or consist of exclusively an amino acid residue or may comprise or consist of one or more chemical linkers (i.e., that are not amino acid residues).

Chemical linkers that are not amino acid residues that are of use according to the methods of the invention may be any appropriate chemical linker or chemical spacer suitable to maintain the spacing and the relative 3D orientation/presentation of the 2 arginine residues (i.e., Arg₁ and Arg₅) and the 2 lysine residues (i.e., Lys₃ and Lys₇) equivalent to the corresponding residues in consensus sequences (i) and (ii). Any such chemical linker/spacer known in the art or described herein may be used in the construction of the molecules of the invention, and it is within the abilities of one of skill in the art to select and adapt such linkers to the methods of the invention. Nonlimiting examples of appropriate chemical linkers/spacers include beta- and gamma-peptides such as those disclosed and described in e.g. Seebach et al., Chem. Biodivers. 1(2004), 1111-1239; sugar amino acid based scaffolds such as those disclosed and described in e.g., Chakraborty et al., Comb. Chem. Hiqh Throughput Screen. 5(2002), 373-387; beta-hairpin peptidomimetics such as those disclosed and described in e.g., Robinson, Acc. Chem. Res. 41(2008), 1278-1288; alpha-helical mimetics, beta-sheet/beta-stand mimetics and beta-turn mimetics such as those disclosed and described in e.g., Hershberger et al., Curr. Top. Med. Chem. 7(2007), 928-924 and cyclotides such as those disclosed and described in e.g., Jaqadish and Camarero, Biopolymers 94(2010), 611-616. In preferred embodiments, the chemical linker/spacers are not positively charged.

6.6 BAM Conjugates

As detailed herein, the molecules and compounds of the invention comprising or consisting of the molecules, peptides and/or polypeptides according to consensus sequences and structures (i) to (vi) are envisioned in preferred embodiments to function as transporter moieties capable of specifically transporting associated cargo molecules/moieties. In preferred embodiments, the molecules and compounds of the invention comprise or consist of the sequences and/or structures according to any of consensus sequences (i) to (vi) further conjugated to a biologically active moiety (BAM); the structure is referenced as a BAM-conjugate throughout this description. The BAM is any moiety known or expected to exhibit a therapeutic effect when administered to an organism, or when introduced to a cell, either in vitro or in vivo. In preferred embodiments, the compounds of the invention comprise or consist of a BAM-conjugate, having a BAM conjugated at the terminus of a consensus sequence or structure (i) to (vi) as described herein. Therefore, in these preferred embodiments, the BAM-conjugate has a consensus structure according to the following consensus structures (vii) to (viii), or the reverses thereof:

(vii) (BAM)-(LINK)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇-(LD₈)_(n)-(XD₉)_(m); or (viii) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇-(LINK)-(BAM) wherein (BAM) represents a biologically active moiety; wherein (LINK) represents an optional linker group; wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; wherein XD₉ or XD⁻² represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; wherein

-   -   (a) Arg₁ and Arg_(E) represent L-arginine; and Lys₃ and Lys₇         represent L-lysine;     -   or     -   (b) Arg₁ and Arg₅ represent D-arginine; and Lys₃ and Lys₇         represent D-lysine;         wherein (SP_(A)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 5.8±1.5 Å when         the molecule is in extended conformation;         wherein (SP_(B)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 4.5±1.5 Å when         the molecule is in extended conformation;         and wherein (SP_(C)) represents a chemical linker that     -   (a) consists of a single amino acid residue, which may be any L-         or D-amino acid residue other than D-arginine, D-lysine,         L-arginine or L-lysine;     -   or     -   (b) separates the adjacent amino acid residues by 8.5±1.5 Å when         the molecule is in extended conformation.         6.6.1 The Optional Linking Group

Consensus sequences (vii) and (viii) are identical to consensus sequences (iii) and (iv), respectively, but for the presence of the BAM-moiety and the optional linker group, (LINK). Therefore, selection of the chemical spacers (SP_(A)), (SP_(B)) and (SP_(C)), which, together, may consist of or comprise exclusively amino acid residues or comprise both chemical linkers and amino acid residues as described herein proceeds as described herein with respect to consensus sequences (iii) and (iv), including

-   -   selection of these chemical spacers to consist of or comprise         exclusively amino acid residues according to the embodiments         (and combinations thereof) of consensus sequences (i) and (ii)         as detailed throughout this disclosure, and     -   selection of these chemical spacers to comprise both chemical         linkers and amino acid residues according to the embodiments         (and combinations thereof) of consensus structures (iii) to (vi)         as detailed throughout this disclosure.

The optional linking group (LINK) may be a peptidic linker. If peptidic linker sequences are used, the linker sequences preferably form a flexible sequence of 2 to 10 residues, more preferably 1 to 5 residues. In a preferred embodiment, the linker sequence contains at least 20%, more preferably at least 40% and even more preferably at least 50% Gly or β-alanine residues. Nonlimiting examples of linking groups include, GlyGlyGlyGlyGly (SEQ ID NO:24), GlyGlyGlyGly (SEQ ID NO:25), GlyGlyGly, CysGlyGly or GlyGlyCys, etc. Appropriate linker sequences are well known to and can be easily selected and prepared by a person skilled in the art. The optional linker group may be composed of D-amino acids, L amino acids, and/or combinations thereof.

Alternatively, the BAM and the transporter moiety may be linked by chemical coupling in any suitable manner known in the art or described herein, such as cross-linking methods. However, attention is drawn to the fact that many known chemical cross-linking methods are non-specific, i.e., they do not direct the point of coupling to any particular site on the carrier/transporter moiety or on the cargo moiety (e.g., BAM). Thus, the use of non-specific cross-linking agents may attack functional sites or sterically block active sites, rendering the one or both of the BAM/transporter components of the inventive BAM-conjugate molecule biologically inactive. It is referred to the knowledge of the skilled artisan to block potentially reactive groups by using appropriate protecting groups. Alternatively, the use of the powerful and versatile oxime and hydrazone ligation techniques, which are chemo-selective entities that can be applied for the cross-linking of component (A) to component (B), may be employed. This linking technology is described, e.g., by Rose et al. (1994), JACS 116, 30.

Coupling specificity can also be increased by direct chemical coupling to a functional group found only once or a few times in the BAM component and the transporter moiety. Coupling of the two components of the inventive peptide-conjugate molecule can be accomplished via a coupling or conjugating agent as is known in the art, including standard (poly-) peptide synthesis coupling reagents such as HOBt, HBTU, DICI, TBTU. There are several intermolecular cross-linking reagents which can be utilized, see, for example, Means and Feeney, Chemical Modification of Proteins, Holden-Day, 1974, pp. 39-43. These reagents include, but are not limited to, N-succinimidyl 3-(2-pyridyldithio)propionate (SPDP) or N,N′-(1,3-phenylene) bismaleimide; N,N′-ethylene-bis-(iodoacetamide) or other such reagent having 6 to 11 carbon methylene bridges; and 1,5-difluoro-2,4-dinitrobenzene. Other cross-linking reagents useful for this purpose include, but are not limited to: p,p′-difluoro-m,m′-dinitrodiphenylsulfone; dimethyl adipimidate; phenol-1,4-disulfonylchloride; hexamethylenediisocyanate or diisothiocyanate, or azophenyl-p-diisocyanate; glutaraldehyde and disdiazobenzidine. Cross-linking reagents may also be homobifunctional, i.e., having two functional groups that undergo the same reaction. A preferred homobifunctional cross-linking reagent is bismaleimidohexane (BMH). BMH contains two maleimide functional groups, which react specifically with sulfhydryl-containing compounds under mild conditions (pH 6.5-7.7). The two maleimide groups are connected by a hydrocarbon chain. Therefore, BMH is useful for irreversible cross-linking of proteins (or polypeptides) that contain cysteine residues. Cross-linking reagents may also be heterobifunctional. Heterobifunctional cross-linking agents have two different functional groups, for example an amine-reactive group and a thiol-reactive group, that will cross-link two proteins having free amines and thiols, respectively. Nonlimiting examples of heterobifunctional cross-linking agents are Succinimidyl-4-(N-maleimidomethyl)-cyclohexane-1-carboxylate (SMCC), m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), and succinimide 4-(p-maleimidophenyl)butyrate (SMPB), an extended chain analog of MBS. The succinimidyl group of these cross-linkers reacts with a primary amine, and the thiol-reactive maleimide forms a covalent bond with the thiol of a cysteine residue. Because cross-linking reagents often have low solubility in water, a hydrophilic moiety, such as a sulfonate group, may be added to the cross-linking reagent to improve its water solubility. Sulfo-MBS and sulfo-SMCC are examples of cross-linking reagents modified for water solubility. Many cross-linking reagents yield a conjugate that is essentially non-cleavable under cellular conditions, which would not be preferred. Therefore, some cross-linking reagents contain a covalent bond, such as a disulfide, that is cleavable under cellular conditions. For example, Traut's reagent, dithiobis(succinimidylpropionate) (DSP), and N-succinimidyl 3-(2-pyridyldithio)propionate (SPDP) are well-known cleavable cross-linkers. The use of a cleavable cross-linking reagent permits the cargo moiety, e.g., BAM, to separate from the novel transporter moiety after delivery into the target cell. For this purpose, direct disulfide linkage may also be useful. Chemical cross-linking may also include the use of spacer arms. Spacer arms provide intramolecular flexibility or adjust intramolecular distances between conjugated moieties and thereby may help preserve biological activity. A spacer arm may be in the form of a protein (or polypeptide) moiety that includes spacer amino acids, e.g., proline. Alternatively, a spacer arm may be part of the cross-linking reagent, such as in “long-chain SPDP” (e.g., Pierce Chem. Co., Rockford, Ill., cat. No. 21651 H). Numerous cross-linking reagents, including the ones discussed above, are commercially available. Detailed instructions for their use are readily available from the commercial suppliers. A general reference on protein cross-linking and conjugate preparation is Wonq, Chemistry of Protein Conjugation and Cross-Linking, CRC Press (1991).

It will be appreciated that the different components of the BAM-conjugate molecule should be coupled in a manner so that the different components can still convey at least part of their individual activity and/or properties to the entire transporter cargo conjugate molecule. For example, coupling of the components shall preferably not lead to a total loss in targeting to KS.

6.6.2 The BAM Moiety

The invention encompasses any BAM known or expected to be a therapeutically active component for the treatment or prevention of a disease, disorder or condition, or amelioration of a symptom thereof, e.g., such as chemical compounds, proteins, liposomes, nanoparticles and (poly-)peptides. Additionally or alternately, the BAM may be a detectable marker of use in the diagnosis of a disease or condition such as a fluorescent dye, a radioactive label or a chemoluminesent group.

A. Cytotoxic Drugs

Preferably the BAM of the “BAM-conjugate” of the invention is a pharmaceutical drug, e.g., selected from cytotoxic or anti-tumor drugs which are suitable as a chemotherapy drug. In general, chemotherapy drugs suitable for component (B) can be divided into three main categories based on their mechanism of action. They may

-   (a) stop the synthesis of preDNA molecule buildinq blocks. These     agents work in a number of different ways. DNA building blocks are     folic acid, heterocyclic bases, and nucleotides, which are made     naturally within cells. All of these agents work to block some step     in the formation of nucleotides or deoxyribonucleotides (necessary     for making DNA). When these steps are blocked, the nucleotides,     which are the building blocks of DNA and RNA, cannot be synthesized.     Thus the cells cannot replicate because they cannot make DNA without     the nucleotides. Examples of drugs in this class include     methotrexate (Abitrexate®), fluorouracil (Adrucil®), hydroxyurea     (Hydrea®), and mercaptopurine (Purinethol®), thioguanine, tocoferol,     or, more generally, any nucleotide analogue, e.g., 2′-deoxycytidine     analogues; -   (b) directly damage the DNA in the nucleus of the cell. These agents     chemically damage DNA and RNA. They disrupt replication of the DNA     and either totally halt replication or cause the manufacture of     nonsense DNA or RNA (i.e., the new DNA or RNA does not code for     anything useful). Examples of drugs in this class include cisplatin     (Platinol®) and antibiotics-daunorubicin (Cerubidine®), doxorubicin     (Adriamycin®) belonging to the class of anthracycline antitumor     agents and etoposide (VePesid®) or any intercalator; further     included are radionuclides (e.g., alpha-radionucleids) commonly used     in target cancer treatment (e.g., Bismuth-213); or -   (c) effect the synthesis or breakdown of the mitotic spindles.     Mitotic spindles serve as molecular railroads with “North and South     Poles” in the cell when a cell starts to divide itself into two new     cells. These spindles are very important because they help to split     the newly copied DNA such that a copy goes to each of the two new     cells during cell division. These drugs disrupt the formation of     these spindles and therefore interrupt cell division. Examples of     drugs in this class of mitotic disrupters include: Vinblastine     (Velban®), Vincristine (Oncovin®) and Paclitaxel (Taxol®).

The BAM of the “BAM-conjugate” of the invention may act according to one of the above modes of action. In other terms, each of the classes of anti-tumor drugs, i.e., alkylating agents, nitrosoureas, antimetabolites, plant alkaloids, antitumor antibiotics, and steroid hormones may be used as component the BAM of the inventive transporter cargo conjugate molecule. To describe these drug classes in more detail it is emphasized that each anti-cancer drug may also be categorized according to its effect on the cell cycle and cell chemistry as disclosed above. For example, alkylating agents kill cells by directly attacking DNA.

Alkylating agents may be used, in particular, in the treatment of chronic diseases by targeting diseased cells, e.g., in the treatment of cancers. Cyclophosphamide is an example of a commonly used alkylating agent. Nitrosoureas act similarly to akylating agents and also inhibit changes necessary for DNA repair. These agents cross the blood-brain barrier and are therefore used to treat brain tumors, lymphomas, multiple myeloma, and malignant melanoma. Carmustine and lomustine are the major drugs in this category. Antimetabolites are drugs that block cell growth by interfering with certain activities, usually DNA synthesis. Once ingested into the cell they halt normal development and reproduction. All drugs in this category affect the cell during the “S” phase of the cell cycle. Antimetabolites may be used in the treatment of acute and chronic cancers. Nonlimiting examples of commonly used antimetabolites are 6-mercaptopurine and 5-fluorouracil (5FU). Antitumor antibiotics are a diverse group of compounds. In general, they act by binding with DNA and preventing RNA synthesis. These agents are widely used in the treatment of a variety of cancers. The most commonly used drugs in this group are doxorubicin (Adriamycin), mitomycin-C, and bleomycin. Plant (vinca) alkaloids are anti-tumor agents derived from plants. These drugs act specifically by blocking cell division during mitosis. Steroid hormones may also be useful in treating some types of tumors. This class includes adrenocorticosteroids, estrogens, antiestrogens, progesterones, and androgens. Although their specific mechanism of action is not clear, steroid hormones modify the growth of certain hormone-dependent cancers. Tamoxifen is an example, which is used for estrogen dependent breast cancer. All of the above-mentioned tumor species may be treated by the inventive BAM-conjugate molecules comprising as the BAM any of the above antitumor agents.

One group of cytotoxic or anti-tumor drugs, which may be used as the BAM component of the “BAM-conjugate” of the invention is selected from alkylating drugs, antimetabolica, cytostatics or drugs related to hormone treatment. In this context, it is preferred to select as cytotoxic or anti-tumor drugs compounds of metal, in particular platin (derivative) and taxol classes. In particular, the drug moiety is selected from the group of drugs consisting of, for example, cisplatin, transplatin, satraplatin, oxaliplatin, carboplatin, nedaplatin, chlorambucil, cyclophosphamide, mephalan, azath ioprin, fluorouracil, (6)-mercaptopurine, methrexate, nandrolone, aminogluthemide, medroxyprogesteron, megestrolacetate, procarbazin, docetaxel, paclitaxel, irinotecan, epipodophyllotoxin, podophyllotoxin, vincristine, vinblastine, docetaxel, daunomycin, daunorubicin, doxorubicin, mitoxantrone, topotecan, bleomycin, gemcitabine, fludarabine, navelbine and 5-FUDR. Particularly preferred is the class of metal-containing anticancer drugs, e.g., the class of platinum compounds.

Further cytotoxic or anti-tumor drugs, which may be used as the BAM component of the “BAM-conjugate” of the invention are Alitretinoin, Altretamine, Azathioprine, Bicalutamide, Busulfan, Bortezomib, Capecitabine, Carfilzomib, Cyclophosphamide, Exemestane, Letrozole, Finasteride, Fostamatinib, Gefitinib, Imatinib, Lenalidomide, Marizomib, Megestrol Acetate, Nilotinib, Triptorelin, Temozolomide, Mifepristone, Tretinoin, Tamoxifen, Teniposide, Peplomycin sulfate or the class of camptothecins.

Further cytotoxic or anti-tumor drugs that may be used as the BAM component of the “BAM-conjugate” of the invention are radionuclides (e.g., alpha particle emitting radionucleids) such as those commonly used in target cancer therapies well known in the art. Non-limiting examples of such radionucleids that may be of use according to the invention include Molybdenum-99, Technetium-99m, Bismuth-213, Chromium-51, Cobalt-60, Copper-64, Dysprosium-165, Erbium-169, Holmium-166, Iodine-125, Iodine-131, Iridium-192, Iron-59, Lutetium-177, Palladium-103, Phosphorus-32, Potassium-42, Rhenium-186, Rhenium-188, Samarium-153, Selenium-75, Sodium-24, Strontium-89, Xenon-133, Ytterbium-169, Ytterbium-177, Yttrium-90, Radioisotopes of caesium, gold and ruthenium. Also included are Cyclotron Radioisotopes such as Carbon-11, Nitrogen-13, Oxygen-15, Fluorine-18, Cobalt-57, Gallium-67, Indium-111, Iodine-123, Krypton-81m, Rubidium-82, Strontium-92, Thallium-201. Alternatively or additionally, any of the above non-limiting examples may also be implemented for diagnostic or imaging uses.

Another group of cytotoxic or anti-tumor drugs, which may be used as the BAM component of the “BAM-conjugate” of the invention are indolocarbazole compounds, e.g, staurosporin (and its analogues) and rebeccamycin. It is to be mentioned that compounds belonging to the class of anilinoquinazolines (e.g., gefitinib) are also particularly preferred as the BAM component of the BAM-conjugates of the invention.

A further group of cytotoxic or anti-tumor drugs, which may be used as the BAM component of the “BAM-conjugate” of the invention may be selected from inhibitors of topoisomerases, such as irinotecan, or mitotic kinesins or DHFR.

Additionally, cytotoxic or anti-tumor drugs, which may be used as the BAM component of the “BAM-conjugate” of the invention can be selected from factors inhibiting or stimulating cell proliferation (PDGF), intracellular pathways, e.g., the RAS/RAF signaling pathway, such as a member of the RAF/MEK/ERK signaling pathway (e.g., RAF-1) or mitogen-activated protein kinase pathway, CMGC kinase family (containing CDK (cyclin dependent-kinases), MAPK, GSK3, CLK), Ser/Thr kinases that belong to the AGC kinase family containing PKA, PKG, PKC kinase families, receptor tyrosine kinases involved, e.g., in neovascularization and tumor progression, including vascular endothelial growth factor receptor (VEGFR)-2, VEGFR-3, platelet-derived growth factor receptor β, Flt-3, the endothelin (ET) system, that includes ET-1, ET-2, ET-3, and the ET_(A) receptor (ET_(AR)) and ET_(BR), and c-KIT, which are targeted by, e.g., inhibiting their function, and members of the IGF-family, such as IGF-1, IGF-2, IGF-1R, IGF2R, etc.

Another group of cytotoxic or anti-tumor drugs, which may be used as the BAM component of the “BAM-transporter conjugate” of the invention may be selected from inhibitors that target tumor cell proliferation and tumor angiogenesis. Particularly preferred in this context are small molecule antitumor kinase inhibitors directed toward targets on malignant cells and/or vascular cells have antiangiogenic activity. Kinase inhibitors such as those directed toward EGFR, Her2/neu, BCR-ABL, c-KIT, PKC, Raf and PI3, are antiangiogenic by virtue of blocking secretion of angiogenic factors by affected malignant cells. Kinase inhibitors such as those directed toward VEGFR2, VEGFR1, PDGFR, PKC, Raf and P13, are antiangiogenic by effects on vascular cells. Examples of synthetic inhibitors of cyclin dependent kinases (CDKIs) are, e.g., olomoucine, flavopiridol, butyrolactone and their derivatives and thus constrain tumor cell proliferation. On the other hand, antitumor compounds suitable as the BAM component of the inventive peptide-conjugate/BAM-conjugate molecule may be selected from activators of apoptosis programs in cancer cells (e.g., staurosporine) or by down-regulating antiapoptotic proteins, e.g., Bcl-2.

6.7 The Freely-Selectable Positions

As discussed herein, when the consensus structure of the molecule or compound of the invention according to structures (iii) or (iv) consists of or comprises exclusively amino acid residues, the targeting moiety that specifically interacts with KS is represented by the sequence

Arg₁-X₂-Lys₃-X₄-Arg₅-X₆-Lys₇ (SEQ ID NO:5, where a fragment of SEQ ID NO:1 or SEQ ID NO:3; SEQ ID NO:6, where a fragment of SEQ ID NO:2 or SEQ ID NO:4).

As detailed herein positions 2, 4 and 6 have little effect on specific binding of the compounds of the invention to a KS proteoglycan; thus these positions are referenced herein as freely selectable. Use of the charged amino acids arginine or lysine is avoided in these positions, as the charged residues tend to increase the probability of non-specific interactions with proteoglycans in general.

Additionally, it is believed that the targeting moieties exhibit specificity for KS due to the optimal linear arrangement of the arginines at positions 1 and 5, and the lysines at positions 3 and 7. Therefore, the remaining amino acids are preferably selected so as to avoid the induction of alternate 3-dimensional structures such as, e.g., α-helices. The propensity for α-helix formation for each of the naturally occurring peptidic amino acids is known, e.g., as set forth in Chakrabartty et al., Protein Sci. 3(1994), 843-852. The ranking from highest to lowest α-helix potential is as follows:

Ala >Arg⁺>Leu>Lys⁺>Glu⁰>Met>Gln>Glu⁻>Ile>Tyr>His⁰>Ser>Cys>Asn>Asp⁻>Asp>Trp>Phe>Val>Thr>His⁺>Gly>Pro.

Thus, it is preferred that no more than 3 amino acids with a high probability of forming an α-helix are used in the freely selectable positions. Accordingly, in preferred embodiments, no more than 3 alanines are used in the freely selectable positions. In additional or alternative further preferred embodiments, proline is not to be used in the freely selectable position.

Further it is preferred that the freely selectable positions are bulky and/or hydrophobic amino acids, as such amino acids may aid membrane insertion and cell penetration (see, e.g., Rothbard, Cell-Penetratinq Peptides: Processes and Applications. Lanel Ü (Ed, CRC Press (2000)). Thus, in preferred embodiments wherein the molecule or compound of the invention consists or comprises a sequence according to consensus sequences (i) or (ii), the freely selectable positions, 2, 4, and 6 are independently selected from Phe, Trp, Tyr, Val, Met, lie and Leu. It is also known that D-amino acids enhance resistance to proteases and/or other means of protein degradation. Thus, in further preferred embodiments, which may be combined with any other embodiment disclosed herein (whether preferred or not), position 4 or position 6 is a D-amino acid.

It is also noted that the consensus sequence of the compounds of the invention contains optional amino acid residues that act as a proton sponge. In the consensus sequences (i) to (viii), (XD₉)_(m) or (XD⁻²)_(m) and (LD₈)_(n) or (LD⁻¹)_(n), wherein m is 0 or 1 and n is 0-10, represent the optional proton sponge amino acids. The residues of the proton sponge may be selected from among any residue or combination of residues that is(are) known in the art or described herein capable of acting as a proton sponge. In preferred embodiments, XD₉ or XD⁻² (if the residue is present) represents any D-amino acid other than D-arginine or D-lysine and LD₈ or LD⁻¹ (if the residue(s) are present) represents any L- or D-amino acid other than L- or D-arginine or L- or D-lysine. In the most preferred embodiments, XD₉ or XD⁻² (if the residue is present) represents D-histidine, and LD₈ or LD⁻¹ (if the residue(s) are present) represents L- or D-histidine. As is recognized in the art, the accumulation of specific amino acid residues (e.g., histidine) inside acidic vesicles such as an endosome produces the protein sponge effect. For example, the imidazole ring of histidine is a weak base that has the ability to acquire a cationic charge when the pH of the environment drops below 6, which may further promote membrane fusion and endosome escape.

It is expressly contemplated that the embodiments described in this section and throughout this application, whether identified as preferred (including most preferred) or not, can be independently implemented or can be combined with the other disclosed embodiments in the design or selection of a compound of the invention. Therefore, the consensus sequence and/or targeting moiety of the compound may optionally satisfy none, one, or more than one of the embodiments outlined in this section and/or throughout the description. As a non-limiting example of such combination, the freely selectable positions may each be independently selected from Phe, Trp, Tyr, Val, Met, lie and Leu, position 4 may be a D-amino acid, n of (LD₁₀)_(n) may be 0 and m of (XD₁₁)_(m) may be 1.

The novel D/L amino acid configuration and/or novel combination of chemical linkers/spacers and/or combination of the compounds of the invention allow a skilled person to define the in vivo or in vitro persistence of the inventive molecule or compound as defined above in the cell, specifically, with a half-life that closely matches the half-life of, e.g., the BAM to be administered into the cell or nucleus prior to degradation of the transporter construct by proteases.

In the context of the present invention, variants and/or fragments of peptides and/or polypeptides preferably comprise or consist of a (poly-)peptide sequence having at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 85%, preferably at least 90%, more preferably at least 95% and most preferably at least 99% sequence identity over the whole length to the sequence of the (poly-)peptides expressly disclosed herein and subject to the rules for selection of the consensus sequence(s).

6.8 Use of Compounds of the Invention

The consensus sequences (i) to (vi) of the invention are envisioned as transporter moieties that effect transport across the cell membrane and into the cytoplasm and/or nucleus of a cell. It is common to all of the therapeutic compounds (i.e., BAMs) disclosed herein that they have to cross the cell membrane in order to achieve therapeutic effect. By coupling compounds belonging to these classes (compounds directly damaging the DNA in the nucleus of the cell, effecting the synthesis or breakdown of the mitotic spindles or stopping the synthesis of pre-DNA molecule building blocks, or having other therapeutic effects) to the consensus sequences (i) to (vi) of the invention, the entry of the BAM moieties into the cell is enhanced and/or their solubility is enhanced, thereby increasing the efficacy of these therapeutic compounds. In turn, increased cell take-up and, preferably, better solubility of these compounds in the aqueous environment (e.g., the cytosol) allows to lower the dosage of the therapeutic anti-cancer compound may be achieved.

The consensus sequences of the targeting moieties of the compounds of the invention target, in particular, the KS proteoglycan. These proteoglycans are preferentially expressed in conjunction with the surface proteins of cells of the eye, central nervous system and endometrium. Accordingly, the molecules and compounds of the invention allow the specific targeting of such cell types. Therefore, the molecules, compounds and methods of the invention facilitate the efficient delivery of cargo moieties, e.g., BAMs, to cells and/or tissues of the eye, central nervous system and/or endometrium and provides a general means of selectively delivering a substance of interest into these cells and/or tissues. Thus, it is preferred that the BAM to be conjugated to the transporter moiety be any substance the person skilled in the art knows or expects to exert an effect on the activity of such cells, including therapeutic effects (such as treating, preventing, attenuating or ameliorating a disease), or for the purpose of labeling such cells, e.g., for diagnostic purposes or for purposes of scientific research.

The BAM-conjugates of the invention may be used for the treatment, amelioration, prophylaxis, or diagnosis of a wide range of diseases or conditions, or the symptoms thereof, in which ocular tissues, in particular corneal cells, are pathophysiologically involved and/or have a pathophysiological involvement. Non-limiting exemplary diseases or conditions which may be treated include corneal ulcer, corneal flash burn, corneal abrasion, corneal neovascularization, keratitis, ocular herpes, pterygium, trachoma, corneal dystrophy, Fuch's dystrophy, iridocorneal endothelial syndrome, lattice dystrophy, map-dot-fingerprint dystrophy, corneal infection, ocular herpes, and dry eye. The molecules of the invention may also be used to target other diseases or conditions affecting other tissues of the eye, such as diseases involving the retina. Non-limiting exemplary retinal diseases or conditions which may be treated include age-related macular degeneration, diabetic retinopathy, retinitis pigmentosa, retinal degeneration, macular hole, retinoblastoma, glaucoma and onchocerciasis.

The BAM-conjugates of the invention also preferentially target cells of the central nervous system and, thus, may be used for the treatment, amelioration, prophylaxis, or diagnosis of a wide range of diseases or conditions in which these cells are pathophysiologically involved and/or have a pathophysiological involvement. Non-limiting exemplary diseases or conditions which may be treated include anxiety disorders, depression, epilepsy, spinal muscular atrophy, amyotrophic lateral sclerosis, Alzheimer's disease, autism, Ataxia Telangiectasia, Niemann Pick disease Type C, cerebellar degeneration, Machado-Joseph disease, olivopontocerebella atrophy, spinocerebella ataxias, sporadic ataxia, multiple system atrophy, Parkinson's disease, Parkison's syndrome, gait ataxia, herpes zoster, viral encephalitis, Japanese encephalitis, bacterial encephalitis, toxoplasmosis, malaria, amoebic meningoencephalitis, Cryptococcus neoformans encephalitis, Lyme disease, streptococci meningoencephalitis, staphylococci meningoencephalitis, astrocytoma, glioblastoma, oligodendroglioma, ependymoma, medulloblastoma and meningioma.

The BAM-conjugates may also be of use in the treatment of diseases or conditions involving the altered production or metabolism of KS, such as in a KS storage disease, including but not limited to morquio syndrome.

The BAM-conjugates of the invention may also target cells of the endometrial lining of the uterus and, thus, may be used for the treatment, amelioration, prophylaxis, or diagnosis of a wide range of diseases, conditions or physiological functions in which these cells are pathophysiologically involved and/or have a physiological involvement.

Non-limiting exemplary diseases, conditions or physiological functions which may be treated and/or diagnosed include endometrial cancer, endometrial hyperplasia, endometrial polyp, uterine bleeding, endometriosis, adenomyosis, cervicitis, infertility, contraception and regulation of the menstrual cycle.

6.9 Pharmaceutical Compositions and their Administration

The invention also relates to pharmaceutical compositions comprising the molecules and compounds of the invention, e.g., BAM-conjugates. Such compositions comprise a therapeutically effective amount of the compound, and a pharmaceutically acceptable carrier or excipient. Such a carrier or excipient includes, but is not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol, and/or combinations thereof. The pharmaceutical compositions also may include additional therapeutic agents for the treatment of the given disease being treated. The formulation is made to suit the mode of administration. In general, methods of administering polypeptides are well known in the art and can be applied to administration of the conjugates of the invention.

Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or the relevant target tissue as known in the art or described herein. Suitable methods of administering such conjugates in the context of the present invention to a patient are available including oral and parenteral routes. Although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective action or reaction than another route.

Preferably the BAM-conjugates of the invention are administered by parenteral modes of administration, in particular by intravenous, intraperitoneal, intramuscular, intradermal, subcutaneous intrathecal, intraocular, retrobulbar, intrapulmonary or intraarticular means. Such administration routes and appropriate formulations are generally known to those of skill in the art. Formulations suitable for parenteral administration include aqueous and non-aqueous, isotonic sterile injection solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that can include suspending agents, solubilises, thickening agents, stabilizers, and preservatives. Carrier polypeptide-target polypeptide conjugates can also be administered via liposomes.

The molecules and compounds of the invention, alone or in combination with other suitable components, can also be made into aerosol formulations (i.e., they can be “nebulised”) to be administered via inhalation. Aerosol formulations can be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like.

In a preferred embodiment, the pharmaceutical compositions of the invention are provided in lyophilized form to be reconstituted prior to administration. Buffers and solutions for the reconstitution of the pharmaceutical compositions may be provided along with the pharmaceutical formulation to produce aqueous compositions of the present invention for administration.

The phrase “pharmaceutically or pharmacologically acceptable” refers to molecular entities and compositions that do not produce adverse, allergic, or other untoward reactions when administered to an animal or a human. As used herein, “pharmaceutically acceptable carrier” includes any and all solvents, dispersion media, antibacterial and antifungal agents, isotonic and absorption delaying agents and the like.

Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there are a wide variety of suitable formulations of pharmaceutical compositions of the present invention. Pharmaceutically acceptable carriers and excipients are well known in the art, and one or more conjugates of the invention can be formulated into pharmaceutical compositions by well-known methods (see, e.g., Remington: The Science and Practice of Pharmacy, 21st edition, A. R. Gennaro, Ed., Mack Publishing Company (2005); Pharmaceutical Formulation Development of Peptides and Proteins, S. Frokjaer and L. Hovgaard, Eds., Taylor & Francis (2000); and Handbook of Pharmaceutical Excipients, 3rd edition, A. Kibbe, Ed., Pharmaceutical Press (2000)).

Pharmaceutical compositions comprising one or more BAM-conjugates of the invention are optionally tested in one or more appropriate in vitro and/or in vivo animal models of disease, to confirm efficacy, tissue metabolism, and to estimate dosages, according to methods well known in the art. Thus, it is understood that the suitable dose of a composition according to the present invention will depend upon the age, health and weight of the recipient, kind of concurrent treatment, if any, frequency of treatment, and the nature of the effect desired. However, the dosage is tailored to the individual subject, as is determinable by one of skill in the art, without undue experimentation. The total dose of therapeutic agent may be administered in multiple doses or in a single dose. In certain embodiments, the compositions are administered alone, in other embodiments the compositions are administered in conjunction with other therapeutics directed to the disease or directed to other symptoms thereof.

The dose administered to a patient, in the context of the present invention, is sufficient to effect a beneficial therapeutic response in the patient over time, or, e.g., to inhibit infection by a pathogen, to reduce or prevent the symptoms of a disease state, or other appropriate activity, depending on the application. The dose is determined by the efficacy of a particular composition/formulation, and the activity, stability or serum half-life of the BAM polypeptide conjugate employed and the condition of the patient, as well as the body weight or surface area of the patient to be treated. The size of the dose is also determined by the existence, nature, and extent of any adverse side-effects that accompany the administration of a particular composition/formulation, or the like in a particular patient.

The present invention is not to be limited in scope by the specific embodiments described herein. Indeed, various modifications of the invention in addition to those described herein will become apparent to those skilled in the art from the foregoing description and accompanying figures. Such modifications fall within the scope of the appended claims.

Various publications are cited herein, the disclosures of which are incorporated by reference in their entirety.

7. EXAMPLES

7.1 Modeling Binding of TAT Peptides to KS

Approximation of the TAT peptide interaction with KS was performed by computer-aided molecular modeling. The simulation modeled the TAT peptide and KS molecules in extended conformation. It was hypothesized that the Arg residues of TAT corresponding to arginines at positions 1 and 5 of a seven-mer peptides according to the invention (e.g., according to the sequence Arg1-X2-Lys3-X4-Arg5-X6-Lys7 (SEQ ID NO:5, where a fragment of SEQ ID NO:1 or SEQ ID NO:3; SEQ ID NO:6, where a fragment of SEQ ID NO:2 or SEQ ID NO:4) were spaced at favorable intervals to optimally interact with the repeating sulfates of KS. In particular, initial comparison of the 3D structures of a TAT peptide according to the invention and KS, in their extended conformations, revealed that the average distance between side chains n, n+2 of TAT peptide is very close to that of the repeating sulfates of residues n, n+1 and n+2 of KS; see, e.g., (FIG. 1).

Additional studies revealed that TAT and KS likely bind due to the favorable electrostatic interactions between the pair of residues 1/n; 3/n+1; 5/n+2; and 7/n+4 of TAT and KS, respectively. In the recited A/B pairs, A and B designate residues of TAT and KS; see, e.g., FIG. 1, providing schematics of TAT and KS molecules wherein the residues are indicated according to the numbering used herein (see, also FIGS. 2 and 3, providing enlarged schematics of the TAT peptide and KS molecule (in part) of FIG. 1). The favorable interactions between these sets of residues may be explained by 2 mechanisms (which are not necessarily mutually exclusive), depending on whether Arg1 of TAT would bind to a sugar moiety bearing a sulfate or a carboxylate group.

Due to the nature, the size and the flexibility of the two partners, it was not possible to identify the most likely binding mode using docking software. Therefore, the following strategy was implemented: (i) extended 3D conformers of both molecules were used as starting input; (ii) the molecules were manually positioned to form the 3 adjacent contacts as described above (e.g. 3/n+1, 5/n+2, 7/n+3); (iii) a total of 600 ns of Molecular Dynamics (MD) simulations at 300 K was performed while maintaining the chosen triplet of interactions; and, eventually, (iv) determining whether new interactions would emerge during the MD simulations, and at which frequency. Importantly, at the start of the MD simulation, no interactions other than the chosen starting interactions occur between TAT and KS.

The reasoning supporting this approach is that, if the envisioned interaction scheme constituted by the four major pairs of interacting residues is correct, and if, for instance Arg1 of TAT binds preferably to a sulfated moiety of KS, then, a MD simulation starting from a complex with only an interacting triplet (e.g., 1/n, 3/n+1 and 5/n+2) should lead to the spontaneous formation of the additional interactions, e.g., 7/n+4, but only if residue n of KS is a sulfated residue.

The interactions between TAT and KS were modeled using the GROMACS molecular modeling package. The TAT peptide (sequence: RKKRRQRR (SEQ ID NO:20)) was described using the CHARMM22 all-hydrogen force fields (Mackerell et al., J. Phsy. Chem. B 102(1998), 3586-3616), while topology and parameters were obtained using SwissParam for the KS molecule (Zoete et al., J. Comput. Chem. 32(2011), 2359-2368). The KS molecule was modeled to contain 10 sugar moieties, as schematically represented in FIG. 1.

The starting points of the MD simulation were generated by manually positioning TAT and KS to form the interaction schemes of interest, i.e., involving three consecutive residues of KS and 3 residues of TAT (for example, residues 1,3,5 or 3,5,7). To maintain the interaction between the chosen TAT/KS residue pairs during the MD simulation, a “NOE” distance restraint was applied with a minimal and maximal distance of 5 and 7 Å, respectively; and a force constant set at 1000 kcal mol⁻¹ Å⁻². The system was then minimized using 5000 steps of ABNR. Finally, for each starting interaction scheme, a pool of 30 independent MD simulations, each 20 ns in length, was performed at 300K to check whether additional interactions would take place between KS and TAT. The solvent effect was estimated using a Generalized Born implicit salvation model, with a dielectric constant of 1 for the solute and 80 for the solvent. For each MD simulation, 1000 frames, regularly separated, were extracted from the trajectory file. For each frame, pairs of interacting residues were identified as those having heavy atoms at a distance smaller than 5 Å. Finally, the frequency of existence of each possible pair of interacting residues was averaged over all frames extracted from the 30 MD simulations of each pool.

Several other groups of simulations were performed, each using different starting interaction schemes. In all cases where the scheme involved the interaction of Arg1 with a sulfated residue of KS, nearly all the interactions expected according to our above-mentioned hypothesis occurred spontaneously during the MD simulation. In contrast, the expected interactions did not occur when the starting scheme involved an interaction between Arg1 and a non-sulfated KS residue. The results of the simulations support the hypothesis that TAT and KS interact due to favorable electrostatic interactions between the pair of residues 1/n, 3/n+1, 5/n+2, and 7/n+4, where residues n, n+2 and n+4 of KS are sulfated moieties, and residue n+1 and n+3 are non-sulfated moieties.

As predicted by the model results described above, the simulation results indicated that residues at positions 1, 5 and 7 of the seven-mer peptides according to the invention exhibited favorable interactions with the sufates on the sugars of KS. Additionally, a residue at position 3 of the seven-mer peptide was able to interact with the carbinol function on the sugar at position 2 of the KS.

7.2 Binding of TAT Peptides to KS Evaluated by ELISA

The binding of the seven-mer peptides of the invention to KS as predicted by the models, in particular, as compared to the binding to other proteoglycans was tested using a modified ELISA protocol. ELISA plates were coated overnight with 1 mg/ml of heparin sulfate (HS), chondroitin-4-sulfate (C4S), chondroitin-6-sulfate (C6S) or keratan-sulfate (KS) in PBS. The wells were then washed with PBS and incubated for 1 hour at room temperature with 1 mg/ml of an exemplary peptide of the invention having the sequence RYKvRIK (“RYK-KS”; SEQ ID NO:7) or a standard TAT peptide. The plates were then thoroughly washed with PBS and subsequently photographed with a LUMASCOPE™ luminescent microscope. Images were then quantified using the ImageJ software (FIG. 4). The results presented in FIG. 4 demonstrate, that in contrast to the standard TAT peptide (FIG. 4B), the RYK-KS peptide (SEQ ID NO:7) according to the invention specifically binds to the KS proteoglycan, and in particular, selectively binds KS over the other proteoglycans tested (FIG. 4A).

The same modified ELISA protocol was used to examine the impact of modifying the freely-selectable positions 2, 4 and 6 on the specific binding to KS. FIG. 5A demonstrates that the amino acid residue at these positions had little impact on specific binding, but that, as expected, the presence of positively charged residues, e.g., arginine and/or lysine, at these positions correlated with increases in binding that were likely non-selective and due to non-specific interactions; see, e.g., FIG. 5A, the binding of the standard TAT peptide RKKRRQRR (SEQ ID NO:20) and the pure arginine 9-mer, RRRRRRRRR (SEQ ID NO:21). Additionally, the use of proline at position 6 eliminated binding activity; see, FIG. 5A, the binding of RYKvRPKYF (SEQ ID NO:22). Similarly, deviation from the consensus sequence of (i) or (ii) and/or the use of proline also eliminated binding activity; see, FIG. 5A, the binding of RYKvFIKPRh (SEQ ID NO:23).

The impact of sequence variation on specific binding was also investigated with select peptides using the above described modified ELISA. As seen in FIG. 5B, the control consensus peptide of the invention “RYRI” (referenced as RYK-KS in 5a; RYKvRIK (SEQ ID NO:7)) exhibited strong and selective binding to KS. FIG. 5B also demonstrates that the use of proline in the consensus sequence not only eliminated binding to KS, but to the other tested proteoglycans as well, see, FIG. 5B, the binding of sequence RPKHRPK (SEQ ID NO:26). FIG. 5B also shows that the 8-mer peptide IRIkIrIK (SEQ ID NO:27), as expected, did not exhibit specifc and selective binding for KS due to deviation from the consensus selection rules. In particular, each of Arg1, Lys3, Arg5 and Lys7 in peptide IRIklrlK, do not exhibit the same chirality.

7.3 Tissue Expression of Genes Involved in Production of Ks

The genes CHST1 and CHST6 are the primary genes involved in the sulfation of keratan to produce keratan-sulfate. Expression data from Atlas arrays revealed that the CHST1 and CHST6 genes are preferentially expressed in cells of the central nervous system and eye tissue (including lacrimal gland, iris and cornea) (FIGS. 6 and 7). Keratan-sulfate is also known to be expressed and/or has been observed in the endometrium during the hormonal cycle; see, e.g., Graham et al., Hum. Reprod. 9(1995), 926-930. Accordingly, the peptides and methods of the invention preferentially allow the targeting of these tissues, and the treatment or diagnosis of diseases, conditions, or physiological functions wherein these cells have pathophysiological involvement or a physiological function.

7.4 In Vitro Activity of Consensus Sequence Peptides

The activity of peptides of the invention as targeting moieties was tested in vitro in preliminary studies using cells lines expressing KS. The nucleoside analog 3′ amino deoxythymidine (3′AdT) was conjugated to the amino-terminus of the peptide “RYRI” (RYKvRlK (SEQ ID NO:7)) using a single D-Glu residue (3′AdT-e-RYRI). Previous work (data not shown) revealed that 3′AdT was inactive as a cytostatic when added alone to culture medium.

Confluent cultures of MDCK were passaged at 1/20 dilution and exponentionally growing cells were incubated for 96 hours with concentrations of the nucleoside analog conjugates ranging from 0.5 μM to 10 μM. Viability was subsequently assessed using trypan blue exclusion.

FIG. 8 demonstrates a clear dose response. The D-linked conjugate had substantial effects on viability at concentrations at least ≥0.5 μM.

7.5 Estimation of In Vivo Stability

In vivo stability was estimated by mixing peptides with fetal calf serum and incubating at 37° C. for 6 hr. Samples were tested at both 0 and 6 hours. Test peptide RPKHRPK (SEQ ID NO:26) was completely degraded by 6 hrs. In contrast, no degradation of the RYRI peptide (RYKvRIK (SEQ ID NO:7)) or the IRIkIrIK (SEQ ID NO:27) was detected at 6 hours, i.e., both were 100% stable at 6 hours. The results indicate that the inclusion of a D-amino acid residue significantly improves resistance of the compound of the invention to protease activity. 

The invention claimed is:
 1. A molecule or compound comprising one of the following consensus structures (I) or (II): (I) Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))-Lys₇-(LD₈)_(n)- (XD₉)_(m); or (II) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇;

(a) wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; (b) wherein XD₉ or XD⁻² represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; (c) wherein (i) Arg₁ and Arg₅ represent L-arginine; and Lys₃ and Lys₇ represent L-lysine; or (ii) Arg₁ and Arg₅ represent D-arginine; and Lys₃ and Lys₇ represent D-lysine; (d) wherein (SP_(A)) represents a chemical linker that (i) consists of a single amino acid residue, which may be any L- or D-amino acid other than D-arginine, D-lysine, L-arginine, L-lysine; or proline; or (ii) separates the adjacent amino acid residues by 5.8±1.5 Å; (e) wherein (SP_(B)) represents a chemical linker that (i) consists of a single amino acid residue, which is any L- or D-amino acid other than D-arginine, D-lysine, L-arginine, L-lysine; or proline; or (ii) separates the adjacent amino acid residues by 4.5±1.5 Å; and (f) wherein (SP_(C)) represents a chemical linker that (i) consists of a single amino acid residue, which may be any L- or D-amino acid other than D-arginine, D-lysine, L-arginine, L-lysine; or proline; or (ii) that separates the adjacent amino acid by 8.5±1.5 Å; (g) wherein where (SP_(A)), (SP_(B)) and (SP_(C)) each represent a single amino acid residue that is independently selected from any L- or D-amino acid residue other than D-arginine, D-lysine, L-arginine or L-lysine; (SP_(A)), (SP_(B)) and (SP_(C)) together comprise at least one L-amino acid residue and at least one D-amino acid residue.
 2. The molecule or compound according to claim 1, wherein (SP_(A)) represents a chemical linker that separates the adjacent amino acid residues by 5.8 ±1.5 Å; (SP_(B)) represents a chemical linker that separates the adjacent amino acid residues by 4.5±1.5 Å; and (SP_(C)) represents a chemical linker that separates the adjacent amino acid residues by 8.5 ±1.5 Å.
 3. The molecule or compound according to claim 1, wherein one or more of said chemical linkers is a beta-peptide, a sugar-amino acid based scaffold, a beta-hairpin peptidomimetic, a alpha-helical mimetic or a cyclotide.
 4. The molecule or compound according to claim 1, wherein (SP_(A)), (SP_(B)) and (SP_(C)) each represent a single amino acid residue that is independently selected from any L- or D-amino acid residue other than D-arginine, D-lysine, L-arginine or L-lysine, and wherein none of (SP_(A)), (SP_(B)) or (SP_(C)) is a proline residue.
 5. The molecule or compound according to claim 1, wherein each of (SP_(A)), (SP_(B)) and (SP_(C)) are not alanine.
 6. The molecule or compound according to claim 1, wherein neither LD₈ nor LD⁻¹ represents a proline residue.
 7. The molecule or compound according to claim 1, wherein one or more of (SP_(A)), (SP_(B)) and (SP_(C)) are independently selected from the group consisting of Ile, Tyr, Phe, Trp, Val, Met, and Leu.
 8. The molecule or compound according to claim 1, wherein (SP_(B)) or (SP_(C)) represents a D-amino acid.
 9. The molecule or compound according to claim 1, wherein LD₈ or LD⁻¹ independently represents L- or D-histidine.
 10. The molecule or compound according to claim 1, wherein XD₉ or XD⁻² represents D-histidine.
 11. The molecule or compound according to claim 1, wherein m has a value of 1 and/or n has a value of
 0. 12. The molecule or compound according to claim 1, wherein said consensus structure consists of an amino acid sequence selected from the group consisting of: (SEQ ID NO: 7) RYKvRIK; (SEQ ID NO: 8) RYKvRIKYF; (SEQ ID NO: 9) RYKvRIKYFh; (SEQ ID NO: 10) RYKvRIKYA; (SEQ ID NO: 11) RYKvRIKYAh; (SEQ ID NO: 12) RYKvRIKHH; (SEQ ID NO: 13) RYKvRIKHHh; (SEQ ID NO: 14) RMKiRVK; (SEQ ID NO: 15) RMKiRVKFM; (SEQ ID NO: 16) RMKiRVKFMh; (SEQ ID NO: 17) RLKlRLKLL; (SEQ ID NO: 18) RLKlRLKLLh; and (SEQ ID NO: 19) RYKvRIKYr.


13. A conjugate comprising a biologically active moiety (BAM), and a linear consensus structure, wherein said BAM is conjugated to the terminus of said linear consensus structure, and wherein said conjugate comprises one of the following formula: (VII) (BAM)-(LINK)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇-(LD₈)_(n)-(XD₉)_(m); or (VIII) (XD₋₂)_(m)-(LD₋₁)_(n)-Arg₁-(SP_(A))-Lys₃-(SP_(B))-Arg₅-(SP_(C))- Lys₇-(LINK)-(BAM);

(a) wherein (BAM) represents a biologically active moiety; (b) wherein (LINK) represents an optional linker group; (c) wherein, LD₈ or LD⁻¹ represents any L- or D-amino acid other than D-arginine, D-lysine; L-arginine or L-lysine and n has a value of 0 to 10; (d) wherein XD₉ or XD⁻² represents any D-amino acid other than D-arginine or D-lysine and m has a value of 0 or 1; (e) wherein (i) Arg₁ and Arg₅ represent L-arginine; and Lys₃ and Lys₇ represent L-lysine; or (ii) Arg₁ and Arg₅ represent D-arginine; and Lys₃ and Lys₇ represent D-lysine; (f) wherein (SP_(A)) represents a chemical linker that (i) consists of a single amino acid residue, which may be any L- or D-amino acid other than D-arginine, D-lysine, L-arginine, L-lysine, or proline; or (ii) separates the adjacent amino acid residues by 5.8±1.5 Å; (g) wherein (SP_(B)) represents a chemical linker that (i) consists of a single amino acid residue, which may be any L- or D-amino acid other than D-arginine, D-lysine, L-arginine, L-lysine, or proline; or (ii) separates the adjacent amino acid residues by 4.5±1.5 Å; (h) wherein (SP_(C)) represents a chemical linker that (i) consists of a single amino acid residue, which may be any L- or D-amino acid other than D-arginine, D-lysine, L-arginine, L-lysine, or proline; or (ii) that separates the adjacent amino acid by 8.5±1.5 Å; and (i) wherein where (SP_(A)), (SP_(B)) and (SP_(C)) each represents a single amino acid residue that is independently selected from any L- or D-amino acid residue other than D-arginine, D-lysine, L-arginine or L-lysine; (SP_(A)), (SP_(B)) and (SP_(C)) together comprise at least one L-amino acid residue and at least one D-amino acid residue.
 14. The conjugate of claim 13, wherein the linker group (LINK) is L- or D-Glu.
 15. The conjugate of claim 13, wherein said BAM is conjugated to the Arg₁ by a covalent N-terminus amide- or to Lys₇ by a C-terminus ester bond.
 16. The conjugate of claim 13, wherein said BAM is conjugated to one of (SP_(A)), (SP_(B)) or (SP_(C)).
 17. The BAM-conjugate of claim 16, wherein said conjugation is indirect conjugation via a linker group.
 18. The conjugate of claim 16, wherein said conjugation is direct conjugation.
 19. A pharmaceutical composition comprising the conjugate according to claim
 13. 